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(57) Abstract 



Nucleic acid encoding subunits of a polypeptide folding complex containing TCP-1 is useful in expression of the subunits for assembly 
of the complex in vitro or in vivo. The complex promotes folding of polypeptides, such as mbulin, e,g. following recombinant expression. 
The sequences share a high degree of homology but the C-terminii are sufficiently different to enable peptides with sequences derived 
therefrom to be used in obtaining antibodies specific for the various subunits, i.e. able to distinguish between them. The antibodies may be 
used in analysis and design of complexes able to fold different polypeptides. 
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FOLDING PROTEINS 

. The present invention relates to the folding of 
proteins. In particular, the present invention provides 
proteins, which are able to form a complex in vitro, 
5 useful in facilitating folding of proteins*, for example 
those produced using recombinant DNA technology. Genes 
encoding the proteins of the complex are also provided. 
The present invention further relates to methods of 
assembling a protein complex able to fold proteins in an 
10 in vitro environment. 

Molecular chaperones are known to be able to assist 
in the folding of proteins along the folding pathways 
from denatured state to correctly folded product 
(reviewed in [1] ) . One well studied class of molecular 
15 chaperones are the chaperonins (GroEL, HspGO and Rubisco 
subunit binding protein) found in eubacteria, 
mitochondria and plastids (reviewed in [2,3]). They are 
14-mer double- torus structures composed of one (GroEL and 
Hsp60) or two (Rubisco subunit binding protein) subunit 
20 types and show seven-fold symmetry [4,5]. In vitro these 
chaperonins bind denatured proteins and upon ATP 
hydrolysis, release them into aqueous solution where they 
complete folding [6] . In vivo there is evidence that 
they are involved in the folding, transport and assembly 
25 of newly synthesized proteins. The original mutations 
isolated in groE affected the folding of bacteriophage 
particle subunits [7] but more recent genetic analysis 
suggests a more general role in protein biogenesis in E. 
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coli [8] . Hsp6 0 is involved in the import of proteins 
into the mitochondrial lumen from the cytoplasm [9] . 

Although no GroEL-like chaperonins have been 
identified in eukaryotic cytosol, the double- torus TCP-i- 
5 containing particle seems to be a component of the 

eukaryotic folding machinery and may play an analogous 
role to that of GroEL in eubacteria and the GroEL related 
chaperonins in symbiotic organelles. TCP-i is weakly 
related to the GroEL family [lO] but shows nearly 40% 
10 identity to an archaebacterial chaperonin, TF55 [ii] . it 
has been proposed that GroEL and TCP-l are siibfamilies 
derived from a primordial gene [10,12,13,14] and it has 
been suggested that the eukaryotic TCP-l -containing 
chaperonin may have evolved from an archaebacterial 
15 lineage [3,11] . 

Recently, purified chaperonin containing TCP-i has 
been shown to facilitate the folding of actin [15] and 
tubulin [16,17] in vitro and it binds newly-synthesized 
actin, tubulin and some other unidentified polypeptides 

20 in vivo [18] . One striking difference between the 

bacterially derived chaperonins and the TCP-l -containing 
chaperonin is the heteromeric nature of the TCP-i- 
containing particle [14,15,17,19]. There are at least 
five polypeptide species in the complex containing TCP-l 

25 [14,17]. 

To date, little sequence information has been 
available on the polypeptides which make up the complex, 
despite the fact that various parties have obtained the 



10 
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sequences of peptides from a number of polypeptides of 
TCP complexes of different organisms. Frydman et al (17) 
demonstrated the presence of six subunits in bovine TCP 
complex, which they termed "TRiC" (TCP-1 ring complex) , 
5 and obtained some peptide sequence information indicating 
some resemblance both between polypeptides of the complex 
and between these polypeptides and those of other 
organisms. Rommelaere et al [59] looked at the cytosolic 
chaperonin from both rabbit reticulocyte lysate and 
bovine testis . They report finding eight different 
polypeptides in rabbit reticulocyte chaperonin, and 
obtaining partial amino acid sequences of all eight , 

However, full length clones have proved elusive. 
The full sequence of murine TCP-1 has been available 
15 since 1986 (2 0) and Ehmann et al (FEBS, 33 6: 2, 313-316, 
1993) have reported the obtention of a TCP-1 related 
sequence from Avena sativa (oat) seedlings. Despite this 
information being available, there has yet to be a report 
of the obtention of full-length nucleic acid sequences 
20 encoding the components of a mammalian TCP-1 complex. 
Knowledge of short peptide sequences derived from 
individual subunits of chaperonin containing TCP-l has 
not enabled the specific cloning of the full-length cDNA 
for individual subunits. One problem is that in order to 
25 be sure a peptide sequence is derived from a subunit of 
the family, it must be identifiably homologous to the 
only full-length mammalian sequence available, ie TCP-1 
(20,48) . Any DNA sequences derived by reverse 
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translation from the novel peptide sequence will also be 
related to TCP-l and the related gene sequences. If 
these sequences are used as PGR primers they will prime 
synthesis and amplification of many TCP-i related 
5 sequences, so further insight and activity are needed to 
identify the sequences which encode particular subunits 
of the complex. 

The present invention provides individually seven 
nucleic acid molecules with sequences encoding subunits 
10 of the TCP-l -containing chaperonin, different from the 

original Tcp-1 gene (reported in 20) . Since, in mice, at 
least three of the novel Tcp-l related genes are unlinked 
to the mouse t complex, it is proposed to rename the TCP- 
1 complex [14] as CCT, chaperonin containing TCP-l. Only 
15 now we have all the eight complete sequences of the 

ubiquitously expressed subunits is it possible to know 
the gene and subunit to which each PCR product 
corresponds. Likewise, all other TCP-l related genes in 
the databases make no sense without the complete 
20 sequences being available. 

The present invention also provides molecules which 
are mutants, derivatives or alleles of any one of the 
seven sequences provided, particularly mutants, 
derivatives and alleles which encode a protein which 
25 retains a functional characteristic of the protein 

encoded by the respective wild- type gene, especially the 
ability to associate with at least another subunit to 
form a complex able to fold a polypeptide. Changes to a 
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sequence, to produce a mutant or derivative, may be by 
one or more of insertion, deletion or substitution of one 
or more nucleotides in the nucleic acid, leading to the 
insertion, deletion or substitution of one or more amino 
5 acids. Of course, changes to the nucleic acid which make 
no difference to the encoded amino acid sequence are 
included. We have demonstrated the existence of the 8 
gene sequences in yeast and in plants by hybridization 
(52, unpublished results) and 6 genes have been isolated 
from yeast by ourselves and others (53). These 6 yeast 
genes correspond exactly (ie. they are the orthologues 
of) to six of the genes exemplified in this application 
CCTof, y, 6, e, T? and f. We predict that all 
eukaryotic organisms contain at least the set of eight 
15 genes which we have described in mouse. There may be 

tissue specific CCT genes or additional CCT genes in some 
organisms but we would expect each of these to be closely 
related (greater than 70% amino acid sequence homology) 
to one of the eight genes described here. These eight 
20 CCT genes of mouse constitute the basic family which 

» 

comprise the core CCT complex. In a preferred embodiment 
of the present invention, the sequence is one encoding a 
polypeptide found in a human or a mouse. 

The polypeptides may have an amino acid sequence 
25 which shares a significant degree of homology with any of 
the specific sequences provided herein. Such homology 
may for example, be 60% or greater, 70% or greater, 80% 
or greater, 90% or greater or 95% or greater, provided 
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the polypeptide is able to function as a subunit of a 
complex able to fold a polypeptide. 

The sequences of polypeptides encoded by nucleic 
acid according to each of seven different embodiments of 
5 the present invention are provided in Figure 3, (a) to 

(f) and Figure 8 (h) Preferred nucleic acid sequences are 
shown in Figure 8 (b) to (h) . The present invention also 
provides a vector which comprises nucleic acid with any 
one of the provided sequences, preferably a vector from 
10 which polypeptide encoded by the nucleic acid sequence 
can be expressed. The invention further encompasses a 
host cell transfected with such a vector. The host cell, 
may be bacterial or eukaryotic, such as yeast or 
mammalian eg murine . 
15 According to another aspect of the present invention 

there is provided a method of making a polypeptide 
component (subunit) of a complex, which complex is- able 
to fold a polypeptide, which method comprises expressing 
the polypeptide component from encoding nucleic acid, the 
20 component being other than TCP-l.. (A nucleic acid 
sequence encoding TCP-l is shown in Figure 8(a).) 
Preferably the nucleic acid is one with a sequence shown 
in Figure 8 or a mutant, derivative or allele of such a 
sequence. Appropriate cof actors or accessory materials 
25 may be needed in order for the complex to function in 

some circumstances. Indeed, specificity of the complex 
for folding different polypeptides may be altered 
according to which cofactors are present and/or what 
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combination of polypeptide components ("subunits") is 
used. 

The present invention also provides a polypeptide 
component of a complex able to fold a polypeptide, the 
polypeptide being substantially free of other 
polypeptides or substantially free of other polypeptides 
of the complex, and being other than TCP-l. Preferably, 
the polypeptide component, or a mutant or derivative 
thereof can be found in a mammal, most preferably in a 
mouse or a human. In a preferred embodiment of the 
present invention the polypeptide is one which has any 
one of the amino acid sequences shown in figure 8 (b) to 
(h) (see also Figure 3) . Modified polypeptides (eg 
mutants or derivatives) which have the ability to form a 
complex able to fold a polypeptide are encompassed by the 
present invention. 

The provision for the very first time of full-length 
sequences of nucleic acid molecules which encode 
polypeptides of a complex able to fold a polypeptide 
enables the production of such a complex using 
recombinant techniques. According to another aspect of 
the present invention there is provided a method of 
producing a complex able (under appropriate conditions) 
to fold a polypeptide, the method comprising expressing 
polypeptide components of the complex from encoding 
nucleic acid and causing or allowing assembly of the 
polypeptide components into the complex. Preferably at 
least some of the components are murine or human. 
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Preferably a polypeptide component of the complex is any 
one of those shown in figure 8 (a) to (h) , or a mutant or 
derivative thereof. Polypeptide components of the 
complex may be expressed individually and then purified, 
5 with the assembly carried out in vitro by mixing CCT 

subunits in the appropriate combinations. On the other 
hand, siabunits may be expressed together. 

Also provided by the present invention is a complex 
able to fold a polypeptide, the complex having been 
10 produced using recombinant DNA technology. The use of 

recombinant DNA technology in the production of a complex 
enables an intelligent selection to be made about which 
subunits to include, in those cases where not all of 
those provided are not required, and the easy assembly 
15 and purification of complexes with the required 

biological activity. This is facilitated still further 
by the provision of subunit specific antibodies, 
discussed below. The functional test for a useful 
complex is its ability to fold a polypeptide. Results 
20 shown below indicate that different combinations of 

subunits exist in vivo, Immunoprecipitation experiments 
may be used to determine subunit combinations in purified 
complexes. Furthermore, recombinant production of 
different combinations of subunits and in vitro testing 
25 of ability to bind substrate polypeptide (eg in a 

reticulolysate system) or ability to fold polypeptide (eg 
in an E. coli expression system or in a separate 
combination stage following earlier production of the 
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subunits) enables easy determination of subunit 
combinations which are functional . 

The present invention also encompasses the use of a 
complex in the folding of a polypeptide. Generally the 
5 complex will be one produced using recombinant DNA 

technology. As discussed, the complex may consist of all 
or less than all of the subunits provided herein. 
Similarly, the invention provides a method of folding a 
polypeptide which comprises causing or allowing such a 
10 complex to fold the polypeptide, following a previous 
step of production of the complex. In fact, it may be 
that the complex is produced at the same time as the 
polypeptide to be folded, eg by expression from the same 
vector and/or expression in the same host cell, rather 
15 than in a previous step. 

The provision by the present invention of the full- 
length sequences of the subunits of a folding complex 
enables the production of subunits individually, with 
subsequent purification and combining to form a complex, 
the production of subunits together with combination of 
subunits taking place in the host cell culture (eg E. 
coll, yeast or Baculovirus systems) and with subsequent 
purification, or the expression of combinations of 
subunits with substrate in a host cell system (eg E. 
coli, yeast or Baculovirus systems), thus constructing a 
folding factory in vivo. 

Until now, it has proved problematic to obtain 
antibodies which are specific for an individual subunit 
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from the complex, in the sense of distinguishing between 
one subunit and the others. The reasons for this are 
apparent: the sxibunits share (varying degrees of) 
homology with one another. Accordingly, attempts to 
5 obtain subunit specific antibodies have failed. 

Immunisation of an animal with a purified subvmit causes 
the production of antibodies which, in the most part, are 
cross -reactive with a number of subunits. Now that the 
present application has provided the full sequences of 
the subunits for the first time, it has proven . possible 
to identify regions of the subiinits which are 
sufficiently different from corresponding regions in the 
other subunits to enable the production of subunit 
specific antibodies. 

Thus, according to another aspect of the present 
invention there is provided an antibody specific for a - 
CCT subunit (subunit of a complex able to fold a protein) 
other than TCP-l (CCTo;) , preferably a mammalian subunit, 
such as human or murine. In a preferred embodiment, the 
antibody is specific for an epitope of a peptide 
corresponding to a C-terminal portion of a subunit, or a 
variant of such a peptide (modified by any of insertion, 
substitution or deletion of one or more amino acids) . 
Most preferred peptides for this purpose are as follows: 
25 APRKRVPDHHPC (CCTbeta) SEQ. ID NO. 1 

NRQTQAPDAGQE (CCTgamma) SEQ. ID NO. 2 

SILKIDDWNTR (CCTdelta) SEQ. ID NO. 3 

IDDIRKPGESEE (CCTepsilon) SEQ. ID NO. 4 



15 



20 
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SAGRGRGQARFH 



(CCTeta) 



SEQ. ID NO. 5 



SGKKDWDDDQND 



(CCTtheta) 



SEQ. ID NO. 6 



EIMRAGMSSLKG 



(CCTzeta) 



SEQ. ID NO. 7 



These peptides (including any- 



variant modified in any 



10 



15 



20 



way, provided the subunit specific nature of an antibody 
raised to the variant is retained) each represent an 
aspect of the present invention. 

According to a further aspect of the invention there 
is provided a method of obtaining an antibody specific 
for a CCT subunit, other than TCP-i (CCTa) , which method 
comprises immunising a mammal (eg mouse, rat, rabbit, 
horse, goat, sheep or monkey) with a peptide of the 
subunit. Preferably, the peptide is one of those listed 
above, or a peptide which is a variant of one of those 
listed, provided subunit specificity of an antibody which 
binds the peptide variant is retained. Antibodies may be 
obtained from immunised animals using a variety of 
techniques known in the art, and screened, preferably 
using binding of antibody to antigen of interest (one of 
the C-terminal peptides, for example) . For instance. 
Western blotting techniques or immunoprecipitation may be 
used (Armitage et al (1992) Nature 357: 80-82). 

As an alternative or supplement to immunising a 
mammal with a peptide, an antibody specific for a subunit 
may be obtained from a recombinantly produced library of 
expressed immunoglobulin variable domains, eg using 
lambda bacteriophage or filamentous bacteriophage which 
display functional immunoglobulin binding domains on 
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their surfaces; for instance see WO92/01047. The library 
may be "naive", that is constructed from sequences 
obtained from an organism which has not been immunised 
with any of the peptides, or may be one constructed using 
5 sequences obtained from an organism which has been 
exposed to the peptide or peptides of interest. 

Antibodies according to the present invention may be 
modified in a number of ways. Indeed the term "antibody" 
should be construed as covering any binding member having 
10 a binding domain with the required specificity. Thus the 
invention covers antibody fragments, derivatives, 
functional equivalents and homologues of antibodies. 

Example antibody fragments, capable of binding an 
antigen or other binding partner are the Fab fragment 
15 consisting of the VL, VH, CI and CHI domains; the Fd 
fragment consisting of the VH and CHI domains; the Fv 
fragment consisting of the VL and VH domains of a single 
arm of an antibody; the dAb fragment which consists of a 
VH domain; isolated CDR regions and F(ab')2 fragments, a 
20 bivalent fragment, comprising two Fab fragments linked by 
a disulphide bridge at the hinge region. Single chain Pv 
fragments are also included. 

A derivative is a substance derived from a 
polypeptide or antibody. The derivative may differ from 
25 a polypeptide from which it is derived by the addition, 
deletion, substitution or insertion of one or more amino 
acids, or the linkage or fusion of other molecules to the 
polypeptide. Changes such as addition, deletion. 
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substitution or insertion may be made at the nucleotide 
or protein level . 

A hybridoma producing a monoclonal antibody 
according to the present invention may be subject to 
5 genetic mutation or other changes. It will further be 
understood by those skilled in the art that a monoclonal 
antibody can. be subjected to the techniques of 
recombinant DNA technology to produce other antibodies or 
chimeric molecules which retain the specificity of the 
10 original antibody. Such techniques may involve 

introducing DNA encoding the immunoglobulin variable 
region, or the complementarity determining regions 
(CDRs) , of an antibody to the constant regions, or 
constant regions plus framework regions, of a different 
15 immxinoglobulin. See, for instance, EP184187A, GB 

2188638A or EP-A-0239400 . Cloning and expression of 
chimeric antibodies are described in EP-A-0120694 and EP- 
A-0125023 . 

The peptides for use in obtaining antibodies may be 
20 made by any of a variety of techniques known to those 

skilled in the art. For instance, this may be by solid 
phase synthesis (Merrifield, JACS 85: 2159-2154 (1963)) 
or by techniques described in Bodanszky et al , Peptide 
Synthesis, second edition (Wiley, 1976) . Standard 
25 solution peptide synthesis methodologies, using chemical 
or enzymatic methods of amide bond formation may be 
employed. Commercial peptide synthesizing machines are 
available . 
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Conveniently, an amino terminal cysteine can be 
added to the chosen peptide in order that the peptide can 
be coupled to PPD through the cysteine using standard 
chemistry. 

5 Systems for cloning and expression of a polypeptide 

in a variety of different host cells are well known. 
Suitable host cells include bacteria, mammalian cells, 
yeast and baculovirus systems. Mammalian cell lines 
available in the art for expression of a heterologous 
10 polypeptide include Chinese hamster ovary cells, HeLa 
cells, baby hamster kidney cells and many others. A 
common, preferred bacterial host is E. coli. 

Suitable vectors can be chosen or constructed, 
containing appropriate regulatory sequences, including 
promoter sequences, terminator fragments, polyadenylation 
sequences, enhancer sequences, marker genes and other 
sequences as appropriate. For further details see, for 
example. Molecular Cloning: a Laboratory Manual: 2nd 
edition, Sambrook et al, 1989, Cold Spring Harbor 
Laboratory Press. Transformation procedures depend on 
the host used, but are well known. 

Baculovirus expression systems are commercially 
available in kit form, eg MaxBac™. The techniques 
employed are described in Summers and Smith, Texas 
25 Agricultural Experiment Station Bulletin 1555 (1987) . 
See also The Molecular Biology of Baculoviruses (ed 
Doerfler, 1986) for a general overview of Baculovirus. 
Baculovirus expression vectors have been developed for 
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infection into a number of cell -types, including those 
from Aedes aegypti, Autograph calif ornlca, Bombyx mori, 
Drosophxla melanogaster, Spodoptera frugiperda and 
trichoplusia ni, 
5 Embodiments of the present invention are now 

described further by way of illustration, with reference 
to the figures identified below. All documents mentioned 
in the text are hereby incorporated by reference. 
Figure 1 shows one- and Two-dimensional gel 
10 electrophoresis of mouse CCT and bovine TRiC, (a) SDS- 
PAGE analysis (8% gel) of mouse testis CCT. TCP-l- 
containing particles purified by sucrose gradient 
fractionation followed by ATP-affinity column 
chromatography from mouse testis [14] were 
15 electrophoresed. (b-c) 2D-PAGE analysis of mouse CCT 
(same sample as Figure la) and bovine TRiC [17] (Jcind 
gift of F.-U. Hartl) . Subunits of TCP- 1 -containing 
particles purified by sucrose gradient fractionation 
followed by ATP-affinity column chromatography from mouse 
20 testis (b) and bovine testis (c) were separated by lEF 
followed by SDS-PAGE. (d) 2D-NEPHGE analysis of bovine 
TRiC. Subunits of bovine TRiC were separated by NEPHGE 
followed by SDS-PAGE. (e) 2D-NEPHGE analysis of 
antibody-affinity purified CCT. TCP-l-containing 
25 particles were purified from ^^S-labelled F9 cell extracts 
using monospecific monoclonal antibodies to TCP-1. 
Proteins were visualized by silver staining (a-c) , 
Coomassie staining (d) or autoradiography (e) . Arrowhead 
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indicates a 45 kDa protein of pi 6.5 which co-purified 
with F9 cell CCT. Arrows show Hsp70 proteins, 
(f) Molecular weights and pi values of the polypeptides 
comprising mouse CCT and bovine TRiC. These values are 
5 determined from 2D-gel analysis with both pi and 
molecular weight markers. 

Figure 2 shows an alignment of peptide sequences 
deduced from 5' -end DNA sequences of 16 Tcp-1 related 
genes (SEQ. ID NO's 8-23). Clone names are used except 
for Tcp-1 genes of mouse, human and yeast in this figure. 
Letters in parentheses indicate the source species: M, 
Mas musculus (mouse) ; H, Homo sapiens (human) ; C. 
Caenorhabditis elegans (nematode) ; S, Saccharomyces 
cerevisiae (yeast) . Sources of the nucleotide sequences 
15 are as follows: mouse Tcp-1 [20,48]; human Tcp-1 [48]; 
yeast Tcp-1 [27]; 4950 [28]; pGl-pG4 (partial 5' -end 
sequences) [22] and this report; p383 and p384 (partial 
5' -end sequences), pTS2, pTy?, pT62, pTe5, pTfl2 and 
pCBLSO, this report. Although three additional human 
2 0 rcp-1 related partial DNA sequences, pAP3 (A. Malik et 

al., manuscript in preparation), HTR3 [41] and IB713 [49] 
are known, these sequences are not added to this figure 
• due to absence of 5' -end sequences (pAP3 and HTR3) or 
inaccuracy of sequence (IB713) . However, judging by DNA 
25 sequence homology, IB713, pAP3 and pHTR3 are members of 
groups 2, 5 and 6, respectively. 

Figure 3 shows amino-acid sequences of S, y, 6, e, f 
and ri subunits of mouse CCT. Amino-acid sequences of 
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mouse CCTfi (a) (SEQ. ID NO. 25), CCTy (b) (SEQ. ID NO. 
26), CCT6 (c) (SEQ. ID NO. 27), CCTe (d) _ (SEQ. ID NO. 
28), CCTf (e) (SEQ. ID NO. 29) and CCTt; (f) (SEQ. ID NO. 
30) deduced from nucleotide sequences of cDNA clones 
5 pTlS2, pTY7, pT62, pTe5, pTfl2 and pCBLSO respectively. 

Sequences of tryptic peptides derived from B1-B4 of mouse 
CCT (Figure la) are shown by bold underlining. Thin 
underlines indicate amino acids identical to those of 
bovine P1-P3 and P5 ([17] and Figure 1) determined from 
10 tryptic polypeptides of the TRiC subunits ( [17] , and 

unpublished results of F.U. Hartl et al . ) . An amino-acid 
position not determined by peptide sequencing is 
indicated by an 'x' . A bovine tryptic peptide sequence, 
P5/T36, contains 3 amino-acid differences compared to the 
15 mouse sequence and these are shown underneath. 

Figure 4 shows alignment of amino acid sequences of 
eight mouse CCT subunits and the homo-oligomeric 
archaebacterial chaperone TF55. A dash indicates an 
amino-acid gap. Conserved amino acids are indicated by 
20 bold characters. Consensus amino acids of these proteins 
are shown under the alignment (SEQ. ID NO. 33) . The 
amino acid sequence of CCTO is SEQ. ID NO. 31) . The 
amino-acid sequences of mouse CCTor (TCP-1) [20,48] (SEQ. 
ID NO, 24) and archaebacterial chaperone TF55 of 
25 Sulfolobus shiJbatae [11] (SEQ. ID NO. 32) are derived 
from previous publications. 

Figure 5 shows comparison between a consensus motif 
of CCT subunits and the ATP binding motif of cAMP 
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dependent kinase and other kinases in this family. 
Amino-acid secpaences around a consensus motif of CCT 
subunits (positions 102-115 in Figure 4) and the ATP 
binding motif of cAMP dependent kinase and related 
5 kinases [25,26] are compared. Amino acids conserved 
between these two groups are indicated by bold 
characters. (CCTa-SEQ. ID NO. 34, CCT/S-SEQ. ID NO. 35, 
CCT-y-SEQ. ID NO. 36, CCT6-SEQ. ID NO. 37, CCTe-SEQ. ID 
NO. 38, CCTf-SEQ. ID NO. 39, CCT77-SEQ. ID NO. 40, CCT^- 
10 SEQ. ID NO. 41) cAPK-a, cyclic -AMP -dependent kinase a, 

(SEQ. ID NO. 42); PKC-or, protein ^kinase c-a (SEQ. ID NO. 
43); CaMII-a, bovine calcium- calmodulin-dependent kinase 
type I la, (SEQ. ID NO. 44) SNFl, a budding-yeast wild- 
type gene product for sucrose non- fermenting mutant (SEQ. 
15 ID NO. 45); cdc2*, a f ission-yeast cell-division-cycle 
gene product (SEQ. ID NO. 46); CDC7, a budding-yeast 
cell -division-cycle gene product (SEQ. ID NO. 47); Raf, a 
human cellular homologue of murine sarcoma virus product 
(SEQ. ID NO. 48); Src, a human cellular homologue of Rous 
20 avian sarcoma virus product (SEQ. ID NO. 49) ; Abl, a 
human cellular homologue of murine leukaemia virus 
product (SEQ. ID NO. SO); EGFR, human epidermal growth 
factor receptor (SEQ. ID NO. 51) ; INSR, human insulin 
receptor (SEQ. ID NO. 52); PDGFR, mouse platelet -derived 
25 growth factor receptor (SEQ. ID NO. 53) . 

Figure 6 shows hybridization analysis of the Cct 
genes of mouse and yeast. (a) Southern analysis of mouse 
Cct genes. Mouse 129/Sv liver DNA (10 fig/lane) was 
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digested with Hindlll, electrophoresed on a 0.7% agarose 
gel and blotted onto a nylon membrane. The membrane was 
cut into 7 pieces and each of them hybridized with ^^P- 
labelled mouse Ccta/Tcp-l (clone pTlbll) [33] , Cctb 
5 (pTS2) , Cctgr (pT77) , Cctd {pT62) , Ccte (pTe5) , Cctz 
(pTri2) and Ccth (pCBLSO) 1.5kb cDNA probes. These 
membranes were washed in 0 . Ix SSC supplemented with 0 . 1% 
SDS at 65**C. (b) Southern analysis for yeast homologues 
of Cct genes with mouse cDNA probes, Saccharomyces 

10 cerevisiae DNA (1.5 fig/lauB) was digested with PstI, The 
DNA was electrophoresed, blotted and hybridized as 
described in panel (a) . After the hybridization, these 
membranes were washed in 2 x SSC supplemented with 0.1% 
SDS at 58.5^C. Positions of molecular weight markers are 

15 shown on the left side of each panel . 

Figure 7 shows the evolutionary tree of CCT subunits 
of eukaryotes and the homo- oligomer ic chaperonin of the 
archaebacterium Sulfolobus shibatae, TP55 . An 
evolutionary tree based on amino acid substitutions is 

20 constructed by the neighbour- joining method [50] with the 
aligned amino-acid sequences numbers 18-67, 70-157, 169- 
200, 213-241, 251-272, 274-310, 326-372, 376-388, 398-505 
and 521-566 (472 amino acids for each subunit) of mouse 
CCT subunits and archaebacterial chaperone TF55 in Figure 

25 4, and corresponding amino acid sequences of other 

homologues. Amino-acid sequences of human [48], fruit 
fly [51] , plant [29] and yeast [27] homologues of 
CCTa/TCP-1 were derived from previous publications. The 
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amino-acid sequence of the yeast homologue of CCTS was 
obtained from Miklos et ai . [28]. Capitals in 
parentheses indicate source species: MM, Mus musculus 
(mouse); US, Homo sapiens (human); DM, Drosophila 
5 melanogaster (fruit fly) ; CE, Caeiiorhajbdi tis elegans 
(nematode) ; AT, Arabidopsia thalxana (plant) ; SC, 
Saccharomyces cerevisiae (yeast) ; SS, Sulfobus shibatae 
(archaebacterium) . 

Figure 8 shows full nucleotide sequences encoding 
10 subunits of a complex, in this case mouse CCT cDNAs . The 
polypeptide sequences of the predicted open reading 
frames are given under each DNA sequence. Figure 8(a) is 
the sequence of Tcp-la/Ccta (SEQ. ID NO. 54). Figures 8 
(b) (SEQ. ID NO. 55), (c) (SEQ. ID NO . 56), (d) (SEQ. ID 
15 NO. 57), (e) (SEQ. ID NO. 58), (f) (SEQ. ID NO. 59), (g) 
(SEQ. ID NO. 60) and (h) (SEQ. ID NO. 61) are the 
sequences of the Cctb, Cctg, Cctd, Ccte, Cctz, Ccth and 
Cctq genes respectively. 

Figure 9 shows immunoprecipitation of CCT complexes 
20 from mouse F9 cells, precipitated under native conditions 
(14) using anti-peptide antibodies specific to CCTS, 
CCT7, CCTe and CCT,;. A negative control shows absence of 
signal using an anti-peptide antibody to 

Schizosaccharomyces pombe CCTor, which does not recognise 
25 any mammalian subunits. 

Figure 10 2D-PAGE analysis of CCT. Subunits of ATP- 
affinity purified CCT from mouse testis were separated by 
2D-PAGE and proteins were visualised by silver staining 
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(A) . Greek letters indicate the eight subunit species 
whose genes have been cloned, and S6 shows the testis- 
expressed subunit. A novel co-purifying 63kDa protein of 
pi 6.93 is indicated by an arrow and hsp 70 proteins are 
5 shown by arrowheads. CCT sub\inits were immunoblotted 
with rabbit antibodies against carboxy terminal amino 
acid sequences of CCTd (B) , CCTc (C) , CCT^ (E) , CCTy (F) , 
CCTf (G), CCTt? (H), CCT6 (I), and a monoclonal antibody, 
91a, against CCTo; (D) . Panel J shows CCT subunits 
10 immunoblotted with a rabbit antibody to a chaperonin 

consensus sequence thought to be involved in ATP-binding. 
Sequences of the peptide immunogens are shown in Table 2. 
In all panels the acidic side is to the left. 

Figure 11 Analysis of native populations of CCT. 
15 Partially purified CCT from mouse testis was subjected to 
non- denaturing isoelectric focusing followed by SDS- 
PAGE. Proteins were visualised by silver staining (A) . 
Native populations of CCT resolved by non -denaturing lEF 
are indicated by I and II and arrowheads show hsp 70 
20 proteins. Native populations of CCT were immunoblotted 
with a rabbit antibody against CCTe (B) , a BOkDa 
proteolytic fragment of CCTe is indicated by an arrow, or 
a monoclonal antibody against /3-tubulin (C) . in all 
panels the acidic side is to the left. 
25 FIGURE 12 shows in vitro translation products 

produced from individual CCT cDNA clones. Analysis of 3 5 
S- labelled CCT subunits synthesized in rabbit 
reticulocyte lysate from individual CCT cDNA clones. 
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Aliquots of the in vitro translation reactions are 
electrophoresed on 10% SDS-PAGE and visualized by 
autoradiography. Lanes: (l) CCTa, (2) CCT/3, (3) CCTy, 
(4) CCT6, (5) CCT£, (6) CCTf, (7) CCT77, (8) CCTS . M- 
5 molecular weight markers. 

PIGDRE 13 shows detection of labelled CCT complex. 
Figure 13A shows an autoradiogram of a 10% SDS- 
. polyacrylamide gel of 35S-labelled CCT proteins detected 
in fractions from a sucrose density grandient analysis of 
CCT assemblec in vitro in rabbit reticulocyte lysate. 
Eight CCT sxibunit mRNAs, CCTa-CCTfi were translated 
together and the reaction mix applied to a i0%-40% 
sucrose gradient as previously described (14) . Lane M = 
marker proteins, lanes 1-14 contain 30 microlitre 
15 aliquots of 1ml sucrose fractions (fractions 17-4) . 
Fraction 17 is the first, lightest fraction of the 
gradient. Free CCT subunits are observed in fractions .16 
and 15 (lanes 2 and 3) and the assembled complex is found 
in fractions 12 and 11 (lanes 6 and 7) . 
20 Figure 13B shows an autoradiogram of a 10% SDS- 

polyacrylamide gel of 35S-labelled CCT proteins 
immunoprecipitated from sucrose gradient fractions by 
anti-CCT^ antibodies. This analysis demonstrates the 
presence of assembled CCT in sucrose fractions 12 and 11, 
25 but not in fractions 16 and 15 . Immunoprecipitation of 
aliquots of fractions from the same sucrose gradient as 
in Figure 13A. 300 microlitre aliquots of each of 
gradient fractions 17-9 were immunoprecipitated in a iml 
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reaction containing 5 microlitres of anti-CCTjS 
antibodies. Immune -complexes were recovered by binding 
to protein-A sepharose beads. Precipitation of fractions 
16 and 15 with anti-CCT^ antibodies only recovers CCT^S 
5 from the mixture of 8 free, unassembled CCT subunits in 
the light fractions. However, precipitation of fractions 
12 and 11 recovers all subunits consistent with the 
notion that these sucrose fractions contain assembled CCT 
of expected size and sedimentation properties. 
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Summary of materials and methods used in obtaining gene 
sequences . 

Purxfi cation of CCT 

Mouse testis CCT was purified by sucrose gradient 

15 fractionation followed by ATP-agarose affinity 

chromatography as previously described [14] . Bovine 
testis TCP-l ring complex (TRiC) [17] was a kind gift of 
F-U Hartl. Mouse F9 cells were labelled with ^^S- 
methionine and ^^S-cysteine for 5h and CCT was purified by 

20 immunoaffinity chromatography using anti-TCP-1 monoclonal 
antibodies 84A and 91A [14,21]. Isoelectric focusing 
(lEF) [43] and non- equilibrium pH gradient 
electrophoresis (NEPHGE) [44] were carried out according 
to Corbet t and Dunn [4 5] . 

25 Peptide sequencing of CCT subunits 

CCT was subjected to electrophoresis on an 8% SDS- 
PAGE gel and protein bands were stained with 0.1% 
Coomassie Brilliant Blue in 0.5% acetic acid/10% aqueous 
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methanol and destained with 10% aqueous methanol. Bands 
as shown in Figure la were excised and the proteins were 
electroeluted from the gel slices. The proteins were 
concentrated onto polyvinylidene difluoride (PVDF) 
5 membranes using ProSpin cartridges (ABI) and each sample 
was digested with trypsin for I8h at 3 7«C using the 
method described by Fernandez et al. [46] . Digestion was 
terminated with trif luoroacetic acid (TFA) and tryptic 
peptides were isolated from the PVDF supematants using 
an ABI 13 OA microbore separation system. TFA buffers 
(A: 0.1% TFA; B: 0.085% TFA, 70% acetonitrile) and a 
simple linear gradient (5 to 100% B in 50 min; 200 fil per 
min) were employed on a 2.1 x 100 mm Brownlee Aquapore 
RP-300 column. Sequence analysis on the isolated 
15 fragments was performed using an ABI 477A protein 

sequencer with on-line PTH detection and data handling. 

Eight TRiC peptide sequences were piiblished by 
Frydman et al. [17] and a further TRiC peptide sequence 
(P2) was a kind gift of F-U. Hartl . 
20 C. elegans Tcp-1 related cDNA clones 

Nine Tcp-1 related cDNA clones were reported from 5'- 
end single-pass sequencing of randomly selected C. 
elegans cDNA clones [22] . We compared these 9 partial 
sequences and found that they were derived from 4 
25 independent genes. Because the cDNAs were all 

direct ionally cloned, we subcloned the cDNA having the 
longest 5' region for each of the four genes (cmOBglO, 
cml2bl0, cmlld3 and cml3e8) . These pBluescript KS* clones 
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were renamed pGl , pG2 , pG3 and pG4 and represent 1 . 8 kb 
inserts for each of these four C, eiegrans genes. We 
accurately sequenced the 5' ends of these plasmid. 
subclones and deduced 90 amino acids of NH2- terminal 
sequence for each gene (Figure 2). The deduced peptide 
sequences of pGl, pG2, pG3 and pG4 have 74%, 31%, 52% and 
48% identity to mouse TCP-1 [20] at their NHj termini . 
Human Tcp'l related cDNAs 

A cDNA library of the human cell line HT1080 (kindly 
given by P, Mitchell) contains cDNA inserted at the EcoRI 
site of bacteriophage lambda ZAP II . These phages (1200 
pfus) were plated onto each of 24 plates and phage pools 
of each of them were made by incubating each lawn in 5 
mis of SM buffer overnight. ^ Approximately 1.5 mis of 
pooled phage stock was obtained from each plate. A mixed 
primer for the peptide sequence TNDGATI (a motif 
conserved between TCP-1 and TF55) was designed (SEQ. ID 
NO. 62) : 

5 ' -AT (AG) GT (AGT) GC (ACT) CCATC (AG) TT (AGT) GT-3 ' . 

In and out PGR using the specific primer and a primer 
of the lambda ZAPII polylinker on the 5' side of the 
inserts was performed. PGR reactions (60 raM KCl, 15 mM 
Tris-HCl pH8.8, 2.25 mM MgCl^, 250 fiM each dATP, dGTP, 
dCTP and dTTP, 0.4 pmol//il each primer, 25 ^l/tube) were 
prepared and then 1 /xl of the phage stock and 1.5 unit 
Taq polymerase was added to each tube. These were 
subjected to 30 cycles of 93 °C for 15 sec, 45«C for 15 
sec and 72*=»C for 30 sec. PGR products from 20 of the 
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pools gave one or only a few 150-300 bp bands by 1% 
agarose gel electrophoresis. These bands were excised 
from the gel, reamplified and sequenced. Three Tcp-l 
related sequences were identified, subcloned into 
5 pBluescript KS* and named p383, p384 and phl3 . The 

deduced polypeptide sequence from p3 83 has 28% identity 
to mouse TCP-1 over its NH2-terminal 54 amino acids 
(excluding the primed sequence TNDGATI (SEQ. ID NO. 63)), 
that from p3 84 has 41% identity to mouse TCP-l over its 
10 NH2- terminal 51 amino acids (excluding TNDGATI) and that 
from phl3 has 24% identity to mouse TCP-1 over its NH2- 
terminal 65 amino acids (excluding TNDGATI) . 

A partial cDNA clone, pAP3 , was isolated from an 
adult human kidney cDNA library by hybridisation with a 
15 PGR fragment generated using redundant primers 

corresponding to Na/K transporter ATPase sequences 
(Malik, manuscript in preparation) . Upon sequencing, 
pAP3 turned out to contain a 1050 base pair insert 
encoding a Tcp-l related gene. Total mouse brain cDNA 
2 0 was subjected to PGR using two primers from the human 
pAP3 sequence : 

forward 5 ' -ACATGCAGCAGCTCTGTGAG-3 ' ( SEQ , ID NO . 64 ) 

and 

reverse 5 ' -CCTTGCCTAGCACTCACTCC-3 ' (SEQ, ID NO. 65). 
25 An 800 bp fragment of the mouse orthologue of pAP3 gene 
was generated. 

Jsolation of mouse Tcp-l related cDNA clones 

The mouse full-length cDNA clones encoding 6 novel 
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TCP-1 related proteins were cloned as follows. An F9 
embryonal carcinoma cell cDNA library was made in lambda 
ZAP by the method of Nagata et al . [47] and 7.5x10* size- 
selected recombinants were obtained from 1 fig poly (A) * 
5 RNA. This library was transferred onto GeneScreen Plus 
(NEN) membranes and duplicate filters were sequentially 
probed with the following five Tcp-i related cDNA probes. 
The two C. elegans cDNAs, pG2 and pG3 , the three human 
cDNAs, p383, p384 and phl3 and the 800 bp mouse RT-PCR 
10 product from pAP3 were "P- label led and hybridized to the 
membranes in Southern hybridization (SH) buffer (6x SSPE, 
5x Denhart's reagent, 0.5% SDS, 10 mg/ml salmon sperm 
DNA) overnight at 55 °C for the two C. elegans probes, 
60«>C for the three human probes, and 65«>C for the mouse 
15 probe. Between 6-12 positively hybridizing clones were 
purified for each probe. The longest clone (1.8-2.2 kb 
inserts) derived from each screening was named as pTlS2 
(p383 homologue) , pTe5 {p384 homologue) , pTfil {phl3 
homologue) , pTfl2 (pG2 homologue) , pT62 (pG3 homologue) 
20 and pTY7 (pAP3 homologue) . The relationship between 

these probes and clones is described in Figure 2. The 
sixth mouse Tcp-1 related cDNA was recovered during a 
pilot experiment for a mouse testis cDNA sequencing 
project. Ninety-five random mouse testis cDNAs 
25 directionally cloned in lambda ZAPII were single pass 

sequenced from their 5' ends. The sequence of the 80th 
CDNA revealed a new Tcp-1 related gene which is different 
to all the others described above and this plasmid. 
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pCBLSO, was completely sequenced. 
DNA sequencing and computer analysis 

The nucleotide sequences of all the Tcp-l related 
genes, except pCBLSO, were determined by the 
5 dideoxynucleotide chain termination method with fluor- 
escent ly labelled primers using the PRISM kit and a 3 73A 
automated sequencer (ABI) . Clone pCBLSO was manually 
sequenced by dideoxynucleotide chain termination and ^^S- 
dATP using a combination of deletion clones and specific 

10 primers. All these sequences and deduced amino-acid 

sequences were analysed by UWGCG programs on the Silicon 
Graphics Crimson network at the SERC, Daresbury, UK. 
Phylogenetic analyses were also carried out at Daresbury 
using the PHYLIP collection of programs. 

15 Southern blotting 

Mouse strain 129/Sv DNA (10 /zg/lane) and S. 
cerevisiae strain 3a DNAs (1.5 fig/lane) were digested 
with Hindlll and electrophoresed on 0.7% agarose gels and 
blotted onto GeneScreen Plus membranes. Probes of 1500 ± 

20 3 bp length were made by PCR with the mouse Tcp-1 cDNA 

(pTlbll) [33] and mouse Tcp-1 related cDNAs (pTS2 , pT 7, 
pT62, pT€5, pT 12 and pCBLSO) . These 7 probes were 
labelled with ^^P and hybridized to the blotted membranes 
in SH buffer overnight at 65 ^C (mouse) or at 55 ^C 

25 (yeast) . The membranes were washed either in O.lx SSC 
containing 0.1% SDS at 65°C (mouse) or in 2x SSC 
containing 0.1% SDS at 58.5«>C (yeast). 
Results 
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Subunit composition of the chaperonin containing TCP-l 
(CCT) 

Two-dimensional gel analysis of the subunits of the 
CCT reveals 7-9 spots of 52-65 kDa. Figure 1 shows SDS- 
5 PAGE, 2D-PAGE and 2D-NEPHGE comparison of CCT isolated 
from mouse testis and F9 embryonal carcinoma cells, and 
bovine testis TCP-l ring complex (TRiC) [17] . 

Excluding the Hsp70 species which co-purify 
specifically with the CCT [14] , the testis preparations 
10 contain 9 major species (Figure Ib-d) and the F9 cell 

preparation contains seven 53-65 kDa species excluding a 
45 kDa protein (Figure le) . This represents an increase 
in the number of species reported by Lewis et ai. [14] 
because two species (S8 and" S9) , with basic pis, resolve 
15 poorly in the lEF dimension of 2D- PAGE run to 

equilibrium. A 2D-NEPHGE analysis of bovine TRiC clearly 
shows 9 subunit species (Figure Id) . 

We had previously shown, using a monoclonal antibody 
{91A) [21] which recognizes only a single subunit (S3, 
20 TCP-l) of the CCT, that antibody-affinity purified CCT 

has a similar electrophoretic profile to CCT purified by 
ATP-agarose affinity or ion-exchange chromatography, on 
SDS-PAGE gels [14] . Figure le shows a 2D-NEPHGE analysis 
of antibody-affinity purified, CCT from F9 cells; the 
25 polypeptide spot profile strongly resembles CCT 

biochemically purified from testis (Figure la,b) and 
seven 52-65 kDa species are distinguishable by the 2D 
analysis • A 45 kDa protein of pi 6.5 was co-purified 



BNSOOCID: <WO 9520654A1> 



wo 9S/206S4 PCT/GB95/00192 

30 

with CCT by the antibody-affinity method, suggesting a 
novel co-factor or substrate of CCT (arrow head in Figure 
le) . These results confirmed that TCP-l comprises a 
hetero-oligomeric complex with other polypeptides. 
5 Figure If shows a tabulation of the apparent 

isoelectric points and molecular weights of the CCT 
subunits. Frydman et al. [17] reported that bovine 
testis TRiC migrated as 5 bands on a 12.5% SDS-PAGE gel 
(P1-P5) . We found that band P4 contained 5 species of 
10 polypeptides which were distinguishable from one another 
on a 2D gel, and we named them P4.1, 4.2, 4.3, 4.4 and 
4.5 according to ascending pi. The subunits of mouse CCT 
(S1-S9) are numbered according to ascending pi using the 
mouse testis pattern as the standard. These bovine and 
15 mouse spots are distributed from pH6.1 - pH7.1 and MW 65 
kDa - MW52 kDa except for the most basic spot (NEPHGE 
analysis suggests a value of around pH 7.4 for the pi of 
this most basic spot) . 

Isolation of mouse cDNAs encoding seven novel TCP-l 

20 related proteins 

We previously reported [14] the existence, in various 
eukaryotes, of numerous genes and partial cDNA sequences 
related to the original mouse Tcp-i gene [20] , the first 
identified gene encoding an CCT subunit [14,17] . it 

25 proved necessary to use a combination of methods to 

isolate murine Tcp-l related genes and clone the full- 
length cDNAs which encode the seven novel TCP-l related 
proteins described herein. The first and second 
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(clones pT62 and pTfl2) were isolated by cross - 
hybridization with Caenorhahditis elegans cDNA probes 
recovered from the 5' expressed sequence tag collection 
of Waterston et ai . [22]. 

The third, fourth and seventh (clones pTfi2, pTeS and 
pTdl) were isolated by cross -hybridization with human 
probes recovered by PGR of human HT1080 cell line cDNA 
using degenerate primers from a conserved region of TCP-l 
and TF55. 

The fifth gene (clone pCBLSO) was isolated during a 
mouse testis cDNA sequencing project at the Chester 
Beatty Laboratories, London. A cDNA fragment of a Tcp-l 
related gene (clone pAP3) was recovered accidentally from 
a human kidney cDNA library ' during a screen for ion 
15 transport channel genes (Malik et al . , manuscript in 

preparation) and the sixth murine gene (clone pTy?) was 
recovered by hybridization with a mouse cDNA PGR product 
which was made with primers derived from the sequence of 
this human cDNA. 
20 Figure 2 shows the very conserved NH2- terminal domain 

deduced from these various Tcp-l and Tcp-l related genes 
of mouse, human, C. elegrans and Saccharomyces cerevisiae. 
A phylogenetic analysis (UPGMA) [23] based on the amino- 
acid sequences of the domain showed seven groups of genes 
25 (data not shown) , suggesting that in eukaryotes Tcp-l and 
the Tcp-l related genes have each evolved independently 
since they diverged. Analysis of the eight members of 
the Cct gene family suggests that the heteromeric GGT 
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particle had already evolved in a common ancestral 
organism of animals, plants and yeast more than 500 
million years ago and that different subunits may have 
different functions in protein folding.. 
5 Figure 3 and Figure 8 show amino-acid sequences of 

the seven mouse TCP-l related proteins deduced from their 
full-length cDNA clones. 

Figure 8 (b) to (h) shows the DNA sequences of the 
seven genes and the amino acid sequences (of the deduced 
10 open reading frames) underneath. 

The seven Tcp-l related cDNAs encode CCT subunits 

Internal tryptic peptides derived from an SDS-PAGE 
gel separation of mouse testis CCT (B1-B4, Figure la) 
were sequenced because the NH^ termini of every subunit 
15 appeared to be blocked yielding no sequence data. 
Frydman et al. [17] obtained some internal peptide 
sequences from four of the bovine testis TRiC subunits 
(PI, P3, P4a/TCP-l and P5) and they also made available 
to us unpublished peptide sequences which we show here 
20 correspond to TRiC subunit P2 . Comparisons between these 
mouse and bovine peptide sequences and the predicted 
peptide sequences of the seven Tcp-l related cDNAs 
reveals that they each encode particular CCT subxmit 
proteins (Figure 3, Figure 8 (h) , Table 1). We named 
25 these 7 new genes as Cctb (pTfi2) , Cctg (pT77) , Cctd 

(PT62) , Ccte (pTe5), Cctr (pTfl2) , Ccth (pCBLSO) and Cctg 
(pT^Dand their encoded proteins as CCT S, y, 6, f, jj , 
and e subunits respectively. We suggest renaming the 
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Tcp-1 gene [20] as Ccta and the TCP-l protein as CCTa. 
Table 1 summarizes the correspondence between the 
proteins encoded by Cat genes and the spots of CCT 
subunits separated by 2D-gel electrophoresis (Figure 1) . 
5 Similarities among CCT suhunit polypeptides 

Table 1 also shows the lengths in amino- acid number 
and molecular weights of the CCT subunits. They have 
similar characteristics despite the fact that they are 
only around 3 0% identical to one another in amino-acid 
sequence (Table 2). They vary slightly in length from 
531-556 residues and in predicted molecular weight from 
57456-60636 Da. These predicted molecular weights are 
approximately consistent with those determined by SDS- 
PAGE (Figure 1) . 

The pis of the subunits, experimentally determined by 
isoelectric focusing, correlate well with the total 
charge value of each predicted CCT subunit polypeptide 
assuming some histidine residues are charged as cations 
at their respective pis (data not shown) . The percentage 
20 of hydrophobic and charged amino acids in each CCT 

subunit is highly conserved, ranging from 31.6-33.5% and 
23.9-27.3% respectively. 

These conserved chemical properties probably reflect 
common functions of each mammalian subunit and are shared 
25 with the archaebacterial chaperonin TF55. Agard [24] 
suggested that hydrophobic interaction between the E. 
coli chaperonin GroEL and its substrates could be 
important for the folding process. The conserved 
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percentage of hydrophobic amino acids in CCT subunits may 
thus be important for interaction with substrates. 
Hydrophobicity profiles of CCT subunits suggest conserv- 
ation of hydrophobic and hydrophilic amino -acid 
5 distributions among CCT subunits, especially at the NHj 
terminus (data not shovm) . 

Figure 4 shows an alignment of all eight CCT 
sequences and TF55. The alignment of the peptide 
sequences show six major gaps (Nos. 164, 201, 246, 315, 
392 and 508 in Figure 4) and thus indicates seven blocks 
of homology divided by the gaps. Amongst these seven 
blocks, the first (Nos. 1-164) and fifth (Nos. 392-508) 
are the largest and contain some highly conserved motifs : 
LGPKGMDKM (Nos. 52-60) (SEQ.' ID NO. 66), TITNDGATIL (Nos. 
15 71-80) (SEQ. ID NO. 67) , QDDEVGDGTTSW (Nos . 100-112 ) 
(SEQ. ID NO. 68), ERSLHDAL (Nos . 423-430) (SEQ. ID NO. 
69), and W(A/P)GGGA (Nos. 442-448) (SEQ. ID NO. 70). 
The third motif includes a motif absolutely conserved in 
CCT subunits, GDGTT (SEQ. ID NO. 71), previously 
recognized by Lewis et al . [14] to be homologous to a 
nucleotide phosphate binding domain of cyclic -AMP- 
dependent kinase and other members of this kinase family 
[25,26]. Figure 5 shows a comparison between these 
motifs. This suggests that all the CCT subunits share 
25 the common function of ATPase activity. 

Structural genes encoding CCT subunits and homologous 
genes in yeast 

Southern analysis of mouse genomic DNA (Figure 6a) 
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Shows the independent structural genes encoding seven 
mouse CCT subunits . Hybridization of Saccharomyces 
cerevisiae DNA with seven mouse Cct cDNA probes (Figure 
6b) shows one or two bands of Pst I -digested DNA for each 
5 probe. Two Tcp-1 related genes have already been 

identified in yeast; the orthologue of Ccta/Tcp-1 [27] 
and the orthologue of Cctb [28] . Both these genes are 
essential [27,28] and temperature sensitive mutations in 
both genes affect microtubule mediated processes [28] . 
The CCT of S. cerevisiae is composed of numerous subunits 
and the yeast homologues of CCTa/TCP-1 and CCTS are 
components of the same complex since they co -purify 
together [28] . Preliminary sequence data derived from 
yeast DNA clones corresponding to the novel sequences 
15 detected with the mouse Cct probes suggests that they are 
indeed the orthologues of the mouse Cct genes. Southern 
analysis of Arabidopsis genomic DNA with the seven Cct 
cDNA probes also suggests the existence of plant 
homologues for each siibunit (data not shown) ; the 
20 Arabidopsis homologue of Ccta/Tcp-l has been reported 
[29] . It is likely that these seven species of CCT 
subunits are ubiquitous in all eukaryotes. 
Determination of the suhtmit compositions and 
arrangements in CCT 
25 The sequence of the eight CCT subunits reveals that 

they are rather divergent at the C-terminii. A 
monoclonal antibody, 23C, had previously been made (21) 
which reacts with the C- terminus of CCTalpha (Harrison- 
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Lavoie et al, 1993, EMBO J. 7: 2.847-2853) and can be used 
to precipitate CCT complexes. Subxmit specific 
antibodies have now been made, utilising the sequence 
information provided by the invention. The antibodies 
5 were raised against C-terminus peptides with the 
following sequences : 

CCTbeta: APRKRVPDHHPC (SEQ. ID NO. 1) 

CCTgamma: NRQTGAPDAGQE (SEQ. ID NO. 2) 

CCTdelta: SILKIDDWNTR (SEQ. ID NO. 3) 

10 CCTepsilon: IDDIRKPGESEE (SEQ. ID NO. 4) 

CCTeta: SAGRGRGQARFH (SEQ. ID NO. 5) 

CCTtheta: SGKKDWDDDQND (SEQ. ID NO. 6) 

CCTzeta: EIMRAGMSSLKG (SEQ. ID NO. 7) 

The peptides were synthesised by conventional solid phase 
15 chemistry on a commercial peptide synthesizing machine. 
An amino terminal cysteine was added to the peptides and 
used to couple them to purified protein derivative (PPD) 
solid supports. 10 milligrams of purified peptide was 
coupled to 9 milligrams of PPD and the conjugate was 
20 purified twice by column chromatography to remove 

unconjugated impurities. Rabbits were immunised over a 
96 day period. Antibodies were screened and isolated 
using their binding for the peptides, using routine 
methods (reviewed in Harlow, E and Lane, D (1988) , 
25 Antibodies: a laboratory manual . CSH Press, NY.). 

Antibodies specific for individual subunits were 
obtained. The antibodies proved to be specific for both 
mouse and human subunits . 
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Immunoprecipitation performed under native conditions 
(50mM Hepes pH8 , 90mM KCl, 0.5% TX-lOO) with these 
subunit specific antibodies shows that CCT consists of a 
complex mixture of particles. Different antibodies 
5 immunoprecipitate different combinations of subunits (for 
instance, see Figure 9) . Examination of different 
immunoprecipitations allows the determination of the 
composition of CCT in any cell type. This information 
enables the intelligent selection of subunit combinations 
to be used in in vitro construction of CCT, for instance 
by means of recombinant expression of subunits and their 
assembly. 

Antibody charactez-isation 

2D-PAGE Mouse testis CCT was purified by sucro'se 
15 gradient fractionation followed by ATP-affinity column 

chromatography as previously described [14] . Isoelectric 
focusing (lEF) was carried out according to [45] ; and was 
followed by SDS-PAGE on 8% gels and either silver 
staining or electrotransfer of proteins to nitro- 
20 cellulose, immunoblotting and detection by the ECL system 
(Amersham) as previously described [14] . 
Non-denaturing isoelectric focusing 

Mouse testis CCT was partially purified by sucrose 
gradient fractionation according to [14] . The fraction 
25 corresponding to 20% sucrose was mixed with an equal 

volume of sample buffer (40% sucrose w/v, 2% Ampholytes 
[Resolyte 4-8, BDH] ) and subjected to non-denaturing 
• isoelectric focusing at 500V for 4 hours. The non- 
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denaturing lEF was performed in the same way as 
denaturing lEF described by [45] with the substitution of 
40% sucrose for 50% urea and the omission of 3-[(3- 
cholamidopropyl) dimethylammonio] -1-propanesulfonate 
5 (CHAPS) in the acrylamide gel mixture. Non- denaturing 

lEF was followed by SDS-PAGE on 8% gels and either silver 
staining or electrotransf er of proteins to nitro- 
cellulose, immunoblotting and detection by the ECL system 
(Amersham) . 

As shown in Figure 10, mouse testis CCT contains nine 
subunit species (Sl-S() by 2D-PAGE analysis (Fig. lOA and 
Table 3), Tcp-1 and seven Tcp-1 related genes encoding 
the or, 0, y, 6, e, f, 77 and 6 subunits of mouse CCT have 
been cloned [52] ; the testis-expressed subunit, S6, and a 
15 co-purifying 63kDa protein (arrowed in Fig. lOA) which 
may be a further subunit of mouse CCT remain to be 
cloned. We have obtained DNA sequences of S6 and S7 and 
have established that they are encoded by two closely 
related CCTf genes . We expect that S6 and S7 can 
20 substitute for each other in CCT and may confer some 
tissue specificity in function of CCT. 

A previously characterised monoclonal antibody, 9 la 
[41,52], was used to detect CCTor (Fig. lOD) . Six 
antibodies recognising single subunit species, BC-l, GC- 
25 1, DC-1, EC-1, TC-1 and THC-2 were produced, which 
reacted specifically with the /3, y. 6, e, -q and 6 
subunits of CCT respectively (Fig. lOB, IOC, lOE, lOF, 
lOH and 101) . In addition to CCTf the antibody ZC-1 also 
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reacted with the testis-expressed subunit S6 (Fig. lOG) , 
which indicates that these two sxibunits S6 and S7, have 
related carboxy terminal sequences and this supports the 
DNA sequence data that they are encoded by closly related 
5 genes. A polyclonal rabbit antibody, UM-i, was made to 
an amino terminal consensus motif which is highly 
conserved between all chaperonin sequences and is 
proposed to be involved in ATP binding and hydrolysis 
(Table 3) [52,53] . UM-i reacted with all nine subunits 
10 of mouse testis CCT including the testis-expressed 
subunit S6. CCT/S, CCTe, CCT7, CCTf and S6 were 
recognised strongly (Fig. 10 J) , which CCTa, CCT6, CCTrj 
and CCTe were recognised weakly (not shown in Fig. 10 j 
but seen on a longer exposure) . In addition to the nine 
15 subunits described above UM-l strongly reacted with the 
co-purifying 63kDa protein, which suggests that this 
protein could be an additional subunit species of mouse 
testis CCT or alternatively a modified isoform of a 
previously identified CCT subunit. 
2 0 Resolution of native populations of CCT 

Partially purified CCT from mouse testis germ cells 
resolves into two distinct populations, I and II, when 
subjected to non- denaturing isoelectric focusing 
(Fig. 11). These populations were further analysed in a 
25 second dimension by SDS-PAGE and visualised by silver 
staining (Fig.llA) or immunoblotting with specific 
antibodies (Fig. IIB and lie) . A striking difference 
between the two populations is that a large number of 
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polypeptides, in addition to subunits of the CCT complex, 
are associated with II whilst I has only a few co- 
migrating species (Fig. iiA) . It is likely that many of 
these associated polypeptides are substrates for folding 
5 by CCT and that II has a high affinity for binding 

polypeptide chain substrates and I has a low affinity. 
To date, only tubulin and actin have been established as 
substrates for folding by CCT in vivo [15,18,16] and both 
^-tubulin (Fig. iic) and actin (data not shown) are found 
10 associated with II but not with I. 

Arrowheads show hsp 70 proteins which resolve away 
from CCT (Pig. ha) , possibly due to the high pH 
conditions during sample loading. Small amounts of hsp 
70 proteins co-immunoprecipitate with CCT under native 
conditions [14,52] suggesting that hsp 70 is found 
associated due to its interaction with polypeptide chains 
undergoing folding on CCT. Therefore, this technique is 
a stringent analysis of protein interactions and a 
reflection of the high affinity of CCT form II for 
20 substrates. 

Fig. iiB shows CCT populations I and II immunoblotted 
with a polyclonal rabbit antibody to CCTe, however both 
populations contain all nine subunits of mouse testis CCT 
as demonstrated by immunoblotting with the antibodies 
25 described above (data not shown) . Since resolution of 
native lEP markers in this system demonstrates that 
separation is on the basis of charge and not size (data 
not shown) , we propose that I and II represent two 
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distinct conformations of hetero-oliogmeric CCT which are 
resolved under these .conditions due to different 
complements of surface charges being exposed in each 
conformation . 

5 During the biochemical extraction of CCT from mouse 

testis and from other sources, we often observe limited 
proteolysis of all the subunits in preparations of intact 
900kDa CCT complex. A 30kDa fragment of CCTe (arrowed in 
Fig. IIB) is detected only in form II. This differential 
susceptibility to proteolysis indicates that the cleavage 
site is more accessible in li, which is supporting 
evidence that I and II adopt different conformations. 
The size of these proteolytic fragments are between 28 to 
30 kDa suggesting that the protease sensitive sites 
reside in the putative apical polypeptide chain binding 
region of each CCT subunit [53] . 
Discussion of antibodies 

Antibodies specific for each subunit species will be 
useful in investigating the combination and arrangement 
of subunits in the CCT complex and the antibody. UM-1, 
which recognises all the subunits can be used as a 
general reagent to identify CCT from other eukaryotes. 
Characterisation of mouse testis CCT revealed that the 
testis-expressed subunit, S6 [52], is related to S7 and 
that a co-purifying 63kDa protein may be a novel subunit 
of mouse testis CCT. The 63 kDa protein is probably 
encoded by a novel Tcp-l-related gene. 

A problem with the biochemical analysis of large 
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molecular weight complexes such as CCT is in resolving 
different states even when these states differ by the 
addition of accessory polypeptides, such as substrates or 
co-factors. Herein described a non- denaturing 
5 isoelectric focusing technique that facilitates the 
resolution of two forms of CCT. One form is bound to 
many other polypeptides and is susceptible to proteolysis 
whilst the other form is bound to only a few other 
polypeptides and is resistant to proteolysis. 

These results suggest that CCT in the cell may adopt 
two distinct conformations with different affinity for 
polypeptide chain substrates. The conformation with high 
binding affinity for substrates may expose a domain which 
is susceptible to proteolysis in each CCT subunit , and 
15 this site may be located within the substrate binding 

domain since it is in a similar position to the substrate 
binding domain of GroEL [54,55]. In GroEL, dramatic 
structural changes have been associated with ATP binding 
[5,56] . One interpretation of our data is that binding 
2 0 or hydrolysis of ATP acts as a switch between 

conformational forms of CCT that interact strongly or 
weakly with polypeptide chain substrates. The wide range 
of polypeptides found associated with CCT suggests that 
the physiological substrates for CCT may not be limited 
25 to act in and tubulin and implies that CCT has a general 
role in protein folding in the eukaryotic cytosol . 
Production and assembly of polypeptide components of a 
folding coaiplex 
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(1) In order to create a complex able to fold a 
polypeptide, messenger RNA dependent rabbit reticulocyte 
lysates are used. Synthetic, capped mRNAs encoding each 
of the CCT sxibunits to associate are independently 
5 synthesised in vitro by RNA transcription- of each 

linearised Bluescript™ plasmid containing a full-length 
coding CCT sequence under the appropriate control 
sequences. (Sambrook et al, 1989, Molecular Cloning: a. 
Laboratory Manual, CSH Press, New York) These RNAs are 
10 added to rabbit reticulocyte lysate containing ^^s- 

methionine for synthesis of ^=S-labelled CCT subunits, 
which facilitates detection on native electrophoretic 
gels or sucrose density gradients. The presence of 
particular subunits is confirmed also using subunit- 
15 specific antibodies, by immunoprecipitation. 

Synthesis of subunits together leads to the assembly 
of a complex whose ability to bind to and/or fold a 
polypeptide can be tested. 

Translating mRNA encoding a substrate polypeptide to 
20 be folded, such as actin or tubulin, with the subunits 
enables detection of complexes of labelled CCT and 
substrate. Labelled substrate facilitates this. 
Although rabbit reticulocyte extracts contain endogenous 
CCT, this does not cycle with the newly synthesised CCT 
25 subunits . 

(2) The rabbit reticulocyte system of (i) is used to 
investigate different combinations of subunits to assess 
their ability to associate and form a complex able to 
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bind and/or fold a polypeptide of interest. For example, 
an assembled complex of translation products of mRNAs 
encoding eight subunits (si, S2, S3, S4, S5, S6 or S7, S8 
and S9) is precipitated using an antibody specific for 
5 any one of the subunits (Figures 13 and 14) and its 

binding to a substrate also translated in vitro, such as 

tubulin, is testable (16) .• 

(3) Subunits are expressed in E. coli using appropriate 
vectors. The combination of subunits chosen for a 
particular purpose, ie folding of a particular 
polypeptide, is made in large quantities. The subunits 
associate to form heteromeric structures of expected 
sedimentation properties and molecular weight. Subunits 
may be expressed individually and then purified, with the 
assembly carried out in Vitro by mixing CCT subunits in 
the appropriate combinations. 

(4) For large scale production, subunits are expressed 
using Baculovirus. A nucleotide sequence encoding a 
subunit is inserted into a suitable expression vector and 
operably linked to control elements within that vector. 
The vector -and wild-type viral genome are transfected 
into an insect host cell where the vector and viral 
genome are allowed to recombine to form a recombinant 
baculovirus expression vector. The baculovirus vector is 
packaged into infectious recombinant baculoviruses . Large 
quantities of subunits are produced which are able to 
associate to form complexes able to fold a polypeptide 
substrate . 



BNSOOCID: <WO 9520654A1> 



10 



15 



20 



25 



WO 9S/206S4 PCT/GB9S/00192 

45 

(5) Folding of a polypeptide substrate, for instance 
tubulin, actin or lucif erase,' is tested by a standard 
technique as described in Frydman et al (17) . See also 
references (4) , (is) and. (16) . 
5 Assembly of mouse CCT from 8 recombinant cDNA clones. 

The purpose of these experiments is to show that the 
8 CDNA clones encoding the CCTa, /S, y, 6, e, f, ,7 and 5 
subunits are necessary and sufficient to produce 
recombinant CCT proteins capable of co-assembling into 
the heteromeric core-CCT complex which we expect from our 
biochemical and genetic analysis to be constructed from 
these 8 subunits. We have demonstrated this in the 
following way. Plasmids containing T3 or T7 promoters 
upstream of the CCT cDNA open reading frames were 
linearized with appropriate restriction endonucleases to 
produce templates for the synthesis of each CCT mRNA by 
T3 or T7 RNA polymerase . The plasmids which we use are 
CCTa/pT1611, CCT^/pT^2, CCT-y/pTyV, CCT6/pT62, CCTe/pTe5, 
CCTr/pTfl2, CCTT7/pT7;29 and CCT5/pT&l. 

When rabbit reticulocyte lysate was programmed singly 
with each CCT subunit encoding cDNA a 35S-labelled 
polypeptide of expected molecular weight was synthesized 
(Figure 12) . 

To show assembly, the lysate was programmed with all 
eight subunit encoding plasmids together. After 
synthesis the reaction mix was applied to a 10-40% 
sucrose gradient in order to resolve the assembled CCT 
complexes from the free subunits as shown previously by 
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US {Reference 14). Labelled CCT complex can be detected 
at 20% sucrose density (Figure 13A) and all the subunits 
are immuno-precipitated using antibodies specific to 
single subunits (Figure 13B) . This demonstrates the 
5 principle of co- assembly from the 8 subunit genes. This 
process is repeatable using . subunit proteins expressed in 
mammalian tissue culture cells, bacteria and yeasts. 
Yeast expression 

The Piachia pastorls heterologous gene expression 
10 system has been used to express individual CCT subunits 

in yeast. Specifically we inserted the CCTe open reading 
frame into pHIL-D2. This construct was used to transform 
Pischia producing strains expressing CCTe under the 
control of the AOXl promoter. These strains, such as 
15 WIL5, were shown to produce CCTe in response to induction 
by methanol, both using small scale cultures (to measure 
time course of induction) and subsequently by large-scale 
fermentation. In order to follow expression of mouse 
CCTe in Pischia pastoris in these experiments anti-CCTe 
20 antibodies disclosed herein were utilised. Without the 
antibodies the experiments would not have been possible. 
All procedures were according to the In Vitrogen manual 
and as described in Cregg, J.M. et al . , 1993 
Bio/Technology, Vol. ll, 905-910. 
Expression in human cells 

CCT subunits were expressed in human cells as free 
subunits and incorporated into endogenous CCT complex 
using standard eukaryotic expression vectors such as 
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pcDNAl. Specifically, the CCTa and CCT/3 open reading 
frames were subcloned into pcDNAl giving plasmids pEXa 
and pEX^. These plasmids were transfected into human 
293T cells and CCT protein expression was measured by 
5 Western blotting and immunoprecipitation using antibodies 
disclosed herein. Substantial expression of CCTa and 
CCT^ was obtained. 
Additional Discussion 

The symmetry of each ring of the CCT double -torus 
) particle is yet to be determined, although EM-negative 
stain analysis suggests that it is likely to comprise 8 
or 9 subunits [14] . Although the stoichiometric 
composition of CCT with respect to individual polypeptide 
species has not yet been determined biochemically, we 
have confirmed that there are seven species of CCT 
subunits by 2D-NEPHGE analysis of antibody-affinity 
purified CCT purified from mouse F9 cells. We have been 
able to assign eight CCT subunit proteins to eight Cct 
genes including Tcp-l/Ccta. The analysis of CCT [14] and 
the act in chaperonin [15] by negative stain electron 
microscopy is consistent with the heteromeric nature of 
the CCT since the particles appear quasi -symmetrical with 
different sized/shaped subunits. Lewis et al . [14] 
previously showed that mouse testis CCT contained more 
subunits than human HEp2 cell CCT and Roobol and Carden 
[19] found differences between brain CCT and testis CCT 
derived from rats and guinea-pigs. These data are 
consistent with the idea that a particular CCT particle 
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is composed of several kinds of subunits, although some 
subunits may be replaceable depending upon tissue type. 

The eight genes (the seven new ones plus the original 
TCP-i (alpha) ) described herein are sufficient to 
5 reconstitute CCT. Certain cell types such as mouse F9 
and human HEp-2 contain abundant levels of 7 cCT 
subunits, with the S2 subunit, encoded by the CCTe gene, 
appearing to be absent, based on protein analysis but it 
may be rapidly turning over or post-translationally 
modified causing levels of S2 to be difficult to measure. 
The role of the mouse testis expressed subunit, S6, is 
unclear but it probably has an exchangeable function with 
S7, the product of the CCTzeta gene, given that 
antibodies raised against S7 react with S6 and that we 
15 have DNA clones for two closely related CCTf genes. 
Northern blotting and protein analysis lead to the 
conclusion that SI, S2, S3, S4 , S5, S7, S8 and S9 are 
ubiquitously expressed and therefore that they constitute 
the eight essential components of general CCT activity. 
2 0 One can build/reconstruct CCT folding machines 

(complexes) using combinations of these seven components. 
Specialised proteins may need the presence of an 
additional, eg tissue specific, subunit or other 
cof actors to be correctly folded. However, these should 
25 be able to be incorporated into CCT machines in 
combination with the subunits provided herein. 
Evolutionary origin of CCT subunit genes 

All organisms are classifiable into three primary 
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kingdoms, eubacteria, archaebacteria and eukaryotes, 
based upon ribosomal RNA sequences [3 0] . Several 
proteins found in archaebacteria and eukaryotic cytosol 
are similar to each other and those of mitochondria, 
5 chloroplasts and eubacteria are also similar to each 
other [31] . 

Our sequence analysis has shown that each CCT subunit 
is very similar to TF55 of an archaebacterium Sulfolobus 
shibatae (Figure 4, Table 2) and only weakly related to 
GroEL and other chaperonin proteins which are thought to 
have evolved from a common eubacterial origin [10] . 
Another archaebacterium, Pyrodictivm occul turn, also has 
an ATPase complex whose tryptic peptides share 70% 
identity with TPS 5 [32]. These results suggest that all 
the CCT genes evolved from a common ancestor of 
eukaryotes and archaebacteria. Each eukaryotic Cct gene 
is as divergent from another Cct gene as it is from TF55 
and this suggests that the eukaryotic genes diverged from 
one another very early in the eukaryotic lineage. An 
evolutionary tree of CCT subunits and TF55 based on 
amino-acid substitutions supports these ideas (Figure 7) . 
If the amino-acid substitution rate of each CCT subunit 
is constant during evolution [33] and assuming that 
yeasts and animals diverged 1000-1200 million years ago, 
25 then we calculate 1800-2400 million years for the 
divergence times of CCT subunits. 

However, the orthologues of each CCT member in 
animals, plants and yeasts are much more similar to one 
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another and this suggests that each subunit has evolved 
an independent function a long time ago which has been 
maintained in most eukaryotes . Since CCT is composed of 
7-9 subunit species and other chaperonins have only one 
5 or two subunit species, it seems likely that the CCT has 
evolved more complex functions in eukaryotic cytosol. 
The increased complexity of CCT may have facilitated the 
evolution of the highly organized eukaryotic cytosol by 
co-evolution between CCT and substrates. 
10 Function of the CCT 

CCT, TF55 and classical chaperonins (GroEL, Hsp60 and 
Rubisco binding protein) all have ATPase activity. It 
seems likely that the conservation of sequence in this 
family reflects the maintenance of ATPase activity rather 
15 than the ability to form double torus structures which, 
many unrelated proteins, such as glutamine synthetase 
[34] can adopt. In GroEL the conformation of the 
particle changes by rotation of the GroEL subunits 
relative to the perpendicular axis upon binding ATP [5] . 
20 EM analysis of the actin chaperonin in the presence and 

absence of ATP suggests that similar large conformational 
changes may occur [15] . There are three highly conserved 
motifs among the TCP-1, TF55 and GroEL family chaperonins 
(see Figure 4 in [14]). Two motifs, 
25 {I/V)T(N/K)DG(A/V) (T/S) (SEQ. ID NO. 72) and GDGTT{S/T) 
(SEQ. ID NO. 73), are positioned at the NHj terminus and 
one, V(A/P)GGG (SEQ. ID NO. 74), towards the COOH 
terminus. These three motifs are also conserved in the 
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other CCT subunits and interestingly the amino-acid 
sequences around GDGTT(T/S) (Figure 5) have homologies to 
the phosphate binding domains for ATP hydrolysis shared 
by the cAMP- dependent kinase family proteins [25,26]. if 
5 these widely separated motifs are indeed involved in ATP 
binding and hydrolysis, then they may be responsible for 
the large conformational changes the chaperonins make 
upon ATP binding. This idea is supported by our 
knowledge of the structure of cAMP- dependent protein 
10 kinase-a [26], Hsp70 [35,36] and actin/myosin [37]; 

several domains required for ATPase activity are widely 
separated in the amino-acid chains of these proteins. 

One possible reason for the multiplicity of CCT 
subunit species may be the contributions of each subunit 
15 to interaction with other co-factors involved in protein 
folding processes. We previously showed that CCT binds 
Hsp70 species [14]. In E. coll, DnaK {Hsp70 family) and 
GroE (Hsp60 family) are thought to cooperate with DnaJ in 
the folding of newly synthesized proteins [38,39] . it is 
20 possible that a similar mechanism involving Hsp70, DnaJ 

homologues [40] and CCT occurs in eukaryotic cytosol . We 
found a weak homology between the COOH terminus of CCT6 
and Hsp70 family proteins (not shown) and this may 
suggest a role for CCT6 in interactions between 
25 chaperones and co-factors. 

It is known that some chaperonins (GroE, Hsp60) are 
not able to fold substrates which can be folded by CCT; 
mitochondrial Hsp60 does not fold or form a binary 
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complex with denatured a-actin [15] and GroEL/ES does not 
fold luciferase although it can bind it [17] , This 
suggests that the substrate specificity of classical 
chaperonins and CCT differs and that CCT might have 
5 changed its spectrum of substrate building by evolving a 
far larger number of subunit species than the single 
siibunit species chaperonins. Stemlicht et al. [18] 
reported that a 40 kDa newly synthesized protein co- 
purified with CCT in addition to actin and tubulin. This 

10 would seem to be a novel substrate of CCT, The general 
function of CCT, in addition to its folding of actin and 
tubulin, is illustrated by a genetic experiment in S. 
cerevisiae. Expression of human HTR3 cDNA which encodes 
a partial TCP-1 related polypeptide rescued a mutation in 

15 amino-acid transport [41] . The HTR3 polypeptide has 96% 
homology to mouse CCTf, and is thus the human orthologue 
of CCTf. This suggests that CCT might assist folding of 
a protein affecting amino-acid transport. CCT binds in 
vitro to chromaffin granule membranes coupled to an 

20 affinity matrix [42] . This suggests a role for CCT for. 
the regulation of secretory vesicles and membrane 
trafficking events. A short peptide sequence of the TCP- 
1- related chaperonin of oats [42] which seems to have 
evolved the specialized function of folding phytochrome, 

25 is more related to CCT6 (Residue nos . 369-384 in Figure 
4) than the other CCT sxibunits . Specialized functions 
might have been derived from different primordial Cct 
genes in different ancestral organisms. 
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CCT is believed to act mainly as folding machinery 
for tubulin and act in by many investigators despite the 
fact that these proteins show no significant amino-acid 
sequence homology. Act in and tubulin are extremely 
5 abundant proteins in the cell, and thus they were 
probably the easiest proteins to be detected as 
substrates [18] . 

Table 4 summarises proteins known to be folded by 
CCT. Firefly lucif erase has been reported to be folded 
10 by CCT in vitro (17, 61) and brain CCT was purified as- a 
neurofilament fragment binding protein [19] . Lingappa et 
al. [62] reported that TCP-1 and/or a TCP-l-like 
chaperonin is involved in capsid assembly by hepatitis B 
virus in vitro. Thus, CCT may assist in the folding of 
not only actin and tubulin but also a more wider range of 
proteins in vivo. 

Functional analysis of each subunit species will 
provide further information of the role of CCT in vivo in 
general protein folding events in eukaryotic cells. 

We have identified nine subunit species of CCT in 
mammalian testis and seven major species in mouse F9 
cells and we cloned seven Tcp-l related genes encoding 
CCT subunits found in common to testis and F9 cell CCT. 
Tcp-l/Ccta and these genes are so highly conserved from 
mammals to yeast they can be detected by cross -species 
Southern blotting analysis. The amino-acid sequences of 
the mouse and yeast orthologues of two CCT subunit 
species (CCTot and CCTS) show 61% and 66% sequence 



15 



20 



25 
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identity, respectively. All CCT subunits contain a motif 
similar to an ATP binding domain of cAMP -dependent 
protein kinase and other kinases of this family, 
suggesting all CCT subunits have independent ATPase 
5 activities. The mouse CCT siibunits show only 26%-35% 

homology between one svibunit protein species and another 
and phylogenetic analysis suggests that the divergence 
time of CCT siibunit genes is approximately two times as 
long as the divergence time of animals and yeasts. 
10 These observations suggest that each subunit has 

evolved a specific, independent function in addition to 
the common ATPase function a long time ago and that these 
functions are maintained in all eukaryotes. The 
expansion of the number of subunit species compared to 
other chaperonins may have allowed more complex functions 
required for folding and assembly of highly evolved 
eukaryotic proteins. 
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Table 1. Characteristics of CCT subunits 



10 



15 



20 



Subunit 
species 



Spot on 2D 
gels 



CCTor 
CCT/S 
CCTy 
CCT6 
CCTe 
CCTf 
CCT77 
CCT0 



S3 
S4 
S5 
S9 
S2 
S7 
SB 
SI 



Number of 
amino acids 



556 
536 
545 
539 
541 
531 
541 
548 



Molecular 
weight 



60513 
57456 
60636 
58073 
59631 
58011 
59658 
59562 



The spot numbers of CCT subunits on 2D gels (S1-S9) are 
described in Figure 1. The correspondences of CCT 
subunit Of to spot S3 tl4] , e to S2, f to S7 and e to SI 
(Figure 10) were determined by Western blotting of 2D 
gels with antibodies which react specifically with each 
subunit. The other correspondences are derived from the 
peptide sequencing data described in Figure 3 and the 
Western blotting data in Figure 10. Numbers of amino 
acids and molecular weights of CCT subunits are 
calculated from the amino-acid sequences in Figure 3 and 
Figure 8 . 
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10 



Table 2. Homology among mouse CCT subunits and 
archaebacterial chaperone TF55 . 





CCTof 


CCTfi 


CCT-y 


CCT6 


CCTe 


CCTf 


CCTt? 


CCT^ 


TP55 


CCTOf 
CCTp 
CCT7 
CCT6 
CCTe 
CCTf 
CCT77 
CCT^ 
TF55 


35.4 
32.5 
32.1 
31.9 
27.1 
34.6 
27.5 
38.7 


7 (24) 

28.4 
28.2 
31.9 
25.0 
31.9 
25.4 
32.0 


3 (13) 
4(8) 

33.1 
32.2 
29.2 
34 .6 
27.4 
34.1 


4(12) 
4 (10) 
5(10) 

36.2 
26.5 
28.5 
27.8 
37.1 


3 (12) 
5(9) 
2 (2) 
4(9) 

31.0 
28.4 
29.1 
38 . 0 


6 (24) 
5(14) 
5 (16) 
5(10) 
5(17) 

26.3 
22.5 
35.1 


5(21) 

4(8) 

4(8) 

3 (4) 

4(6) 

4(8) 

24.0 
34.9 


5(19) 

6(8) 

5(11) 

5(11) 

4(6) 

3(12) 

4 (8) 

5 (10) 


4 (16) 

618) 

2(2) 

5(10) 

3(6) 

4 (13) 

3(6) 

29.0 



15 



Percentages of identical amino acids are shown below the 
diagonal and numbers of gaps/insertions are above the 
diagonal. Numbers in parentheses are total lengths of 
gaps /insert ions given in amino acid numbers. 



20 



Table 3. 



25 



30 



35 



40 



Immunochemical analysis of cCT subunits 




of"^c?r°^|f monoclonal antibody which recognises the C-terminus 

P63** denotes a co-purifying 63kDa protein of pi 6.93. 
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Table 4 Proteins known to be folded or assembled by CCT 
and CO- factors. 



Protein 


Experiment 


Reference 


actin 

actin-RVP 

tiibulin 

nexirofilament (fragment) 
luciferase 

hepatitis B virus caps id 


in vitro, in vivo 
in vitro 

in vitro, in vivo 
in vitro 
in vitro 
in vitro* 


15,18 
57 

16,17,18,57,58,59,60 
19 

17,61 
62 



♦ TCP-1 or TCP-l-like chaperonin recognized by anti-TCP-i antibody. 



*3NSDOCID: <WO 9520654A1> 



wo 95/20654 PCT/GB95/00192 

61 

CLAIMS 

1. A DNA isolate comprising a nucleotide sequence 
encoding a subunit of a polypeptide folding complex, the 
5 subunit having an amino acid sequence shown in any of 
SEQ. ID NO.'s 25, 27, 28, 29, 30 and 31 or an amino acid 
sequence having a significant degree of homology to a 
shown sequence . 

10 2. A DNA isolate according to claim 1 wherein the 

nucleotide sequence is any of SEQ. ID NO.'s 55, 57, 58, 
59, 60 and 61. 

3. A DNA isolate according to claim 1 wherein the 
15 nucleotide sequence is an allele, a mutant or 

derivative, by way of nucleotide deletion, substitution 
or insertion, of any of the nucleotide sequences shown 
in SEQ. ID NO.'s 55, 57, S8, 59, 60 and 61. 

20 4. A DNA isolate comprising a nucleotide sequence 

encoding a subunit of a polypeptide folding complex, the 
subunit comprising an amino acid sequence which is a 
mutant or derivative, by way of amino acid deletion, 
substitution or insertion-, of any of the amino acid 

25 sequences shown in SEQ. ID NO.'s 25, 27, 28, 29, 30 and 



31. 



A vector comprising DNA according to any one of 
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claims 1 to 4 and nucleic acid for expression of the 
encoded subunit , 

6. A host cell comprising a vector according to claim 
5 5. 

7, A host cell according to claim 6 which is 
prokaryotic . 

10 8. A host cell according to claim 6 which is 
eukai^otic - 

S. A method of making a subunit of a polypeptide 
folding complex comprising^ expression from a vector 
15 according to claim 5. 

10. A method according to claim 9 wherein the 
expression results from culturing a host cell comprising 
the vector. 

20 

11. A method according to claim 10 wherein the host 
cell is prokaryotic. 

12. A method according to claim 10 wherein the host 
2 5 cell is eukaryotic. 

13. A method of producing a polypeptide folding 
complex, the method comprising expression of subunits of 
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the complex from encoding nucleic acid therefor and 
causing or allowing assembly of the subunits into the 
complex. 



5 14. A method according to claim 13 wherein the subunits 
comprise any of the amino acid sequences shown in SEQ. 
ID NO.'s 25, 26. 27, 28, 29, 30 and 31 or an amino acid 
sequence having, a significant degree of homology to a 
shown sequence. 

10 

15. A method according to claim 14 wherein the nucleic 
acid encoding any of the subunits has a nucleotide 
sequence which is an allele, a mutant or derivative, by 
way of nucleotide deletion," substitution or insertion, 

15 of any of the nucleotide sequences shown in SEQ. ID 
NO.'s 55, 56, 57, 58, 59, 60 and 61. 

16. A method according to claim 13 wherein the amino 
acid sequence of any of the subunits is a mutant or 

20 derivative, by way of amino acid deletion, substitution 
or insertion, of any of the amino acid sequences shown 
in SEQ. ID NO.'s 25, 26, 27, 28, 29, 30 and 31. 

17. A method according to any one of claims 13 to 16 
25 wherein assembly of the subunits into the complex is by 

in vitro mixing following expression. 

18. A method according to any one of claims 13 to 16 
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19. A purified polypeptide subunit of a polypeptide 
folding complex comprising an amino acid sequence shown 

5 in any of SEQ. ID NO.'s 25, 27, 28, 29, 30 and 31, or an 
amino acid sequence having a significant degree of 
homolocfy to a shown sequence. 

20. A polypeptide subunit of a polypeptide folding 
10 complex comprising an amino acid sequence which is a 

mutant or derivative, by way of amino acid deletion, 
substitution or insertion, of any of the amino acid 
sequences shown in SEQ. ID NO.'s 25, 27, 28, 29, 30 and 



31. 



15 



21. A polypeptide folding complex comprising 
polypeptide subunits selected from subunits according to 
claim 19 or claim 20, optionally a sxibunit comprising 
the amino acid sequence which is shown in SEQ. ID NO. 26 

20 or an amino acid sequence which is a mutant or 

derivative, by way of amino acid deletion, substitution 
or insertion, of the amino acid sequence shown in SEQ. 
ID NO. 26, or an amino acid sequence having a 
significant degree of homology to the sequence shown in 

25 SEQ. ID NO. 26, and optionally a subunit comprising the 
amino acid sequence which is shown in SEQ. ID NO. 24 or 
an amino acid sequence which is a mutant or derivative, 
by way of amino acid deletion, substitution or 
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insertion, of the amino acid sequence shown in SEQ. ID 
NO. 24, or an amino acid sequence having a significant 
degree of homology to the sequence shown in SEQ. ID NO. 
24 . 

5 

22. Use of a polypeptide folding complex according to 
claim 21 in the folding of a polypeptide. 

23 . A method of folding a polypeptide employing a 
10 polypeptide folding complex according to claim 21. 

24. A method according to claim 23 wherein the folding 
of the polypeptide follows assembly of the polypeptide 
folding complex from subunits thereof, following 

15 production of the subunits by recombinant expression. 

25. An antibody or fragment thereof able to bind 
specifically to a subunit of a polypeptide folding 
complex other than TCP-l (CCTa - SEQ. ID NO. 24) . 



20 



26. An antibody or fragment thereof according to claim 
25 wherein the subunit to which the antibody or fragment 
thereof is able to bind is any of CCT0 (SEQ. ID NO. 25) , 
CCT7 (SEQ. ID NO. 26), CCT6 (SEQ. ID NO. 27), CCTe (SEQ. 

25 ID NO. 28), CCTf (SEQ. ID NO. 29), CCTtj (SEQ. ID NO. 30) 
and CCTd (SEQ. ID NO. 31) . 

27. An antibody or fragment thereof according to claim 
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26 having binding specificity for a peptide which has an 
amino acid sequence shown in any of SEQ. ID NO.' s 1, 2, 
3 , 4 , 5 , 6 and 7 , or which has an amino acid sequence 
which is a variant of any of those shown in SEQ, ID 
5 NO. 'si, 2, 3, 4, 5, 6 and 7. 

28- A method of obtaining an antibody or fragment 
thereof according to claim 25 comprising administering a 
peptide which has an amino acid sequence shown in any of 
10 SEQ. ID NO.'s 1, 2, 3, 4, 5, 6 and 7, or which has an 
amino acid sequence which is a variant of any of those 
shown in SEQ. ID NO.'s 1, 2, 3, 4, 5, 6 and 7, to a 
mammal and recovering antibody from the mammal . 
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Mouse 


Bovine 


Spot 




M.W. (KD) 


Spot 




M.W. (KD) 


SI 


6.25 


62 


P4.1 


6.1 


60 


S2 


6.35 


61 


P4.2 


6.3 


61 


S3 


6.5 


60 


P4.3 


6.45 


60 


S4 


6.65 


53 


PI 


6.8 


52 


SS 


6.85 


65 


P5 


6.85 


55 


S6 


6.9 


62.5 


P4.4 


6.85 


60.5 


S7 


6.95 


62 


P2 


7.0 


56 


S8 


7.1 


57 


P3 


7.1 


58 


S9 


>7.2 


56 

V 


P4.5 


>7.2 


61 
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Fig. 3a. 

MASLSLAPVNIFKAGADEERAEIARLSSFIGAIAIGDLVKSTLGPKGMDK 050 

I L L S S ORDAALMVTNDGAT I LKNI G VDNP AAKVL VDM S R VQDDEVGDGTT 100 

P1(T31) 

SVTVLAAELLREAE S L lAKKlHPQTI I SGWREATKAAREALLS SAVDHOS 150 

DEARFWQDLMNIAGTTLSSKLLTHHKDHFTKLAVEAGLRLKGSGNLEAIH 2 00 

VIKKLGGSLADS YLDEGFLLDKKIGVNQPKRIENAKILIANTGMDTDKIK 2 5 0 

X'^X" B4{R162) 

IFGSRVRVDSTAKVAEIEHAEKEKMKEKVERILKHGINCFINRQLIYNYP 300 

EQLFGAAGVMAlEHADFAGVERI.Ar.VTGGEIASTFDHPEI.VKLGSCKLIE 3 5 0 

EVMlGEDKLlHFSGVALGEACTrVLRGATQQlLDEAERSLHDALCVLAQT 400 

VKDPRTVYGGGC SEMLMAHAVTQIiANRTPGKEAVAME SFAKALRMLPTI I 4 5 0 

ADNAGYDSADLVAQLRAAHSEGHITAGLDMKEGTIGDMAVLGITESFQVK 50 0 
RQVLLSAAEAAEVILRVDNIIKAAPRKRVPDHHPC 53 5 



Fig.3b. 

MMGHRPVLVLSQNTKRESGRKVQSGNINAAKTIADIIRTCLGPKSMMKML 05 0 

LDPMGGI VMTNDGNAl LREI QVQHPAAKSMIEI SRTQDEEVGDGTTS VI I 10 0 

LAGEMLSVAEHFLEQQMHPTWISAYRMALDDMISTLKKISTPVDVNNRE 15 0 

MML S I IMS S ITTKVI SRWSSLACNIALDAVKTVQFEEN6RREIDIKKYAR 20 0 

VEKIPGGI lEDSCVLRGVMINKDVTHPRMRRYIKNPRIVLLDSSIiEYKKQ 2 5 0 

ESQTDIEITREEDFTRI LQMEEEY I HQI.CED I IOLK PPWITEKGISDLA 300 

°° P5{T36) 

QHYLMRANVTAIRRVRKTDNNRIARACGARIVSRPEELREDDVGTGAGLL 3 5 0 

EIKKIGDEYFTFITDCKDPKACTILLRGASKEILSEVERMLQDAMQVCRN 4 0 0 

VH.DPQL VPGGGASEMAVAHALTEK S KAMTGVE QWPYRAVAOALEVTPR'P 45 0 
P5(T32) B1(R133) 

LIQNCGASTIRLLTSr.RAKHTQESCETWGVNGETGTI.VDMKELGIWEPr.A 50 0 
VKLQTYKTAVETAVLLLRIDDIVSGHKKKODDQMRQTGAPDAGQE 545 
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Fig.3c. 

MPENVASRSGAPTAGPGSRGKSAYQDRDKPAQIRFSNISAAKAVADAIRT 0 50 

SLGPKGMDKMIQDGKGDVTITNDGATILKQMQVLHPAARMLVELS KAQDI 10 0 

EAGDGTT S VVII AGSLLDSCTKLIiQKGIHPTIISESFQKAIjEKGLEII.TD 15 0 
X P3(T31) 

MSRPVQLSDRETIiLNSATTSIiNSKWSQYSSLLSPMSVNAVMKVIDPATA 20 0 

TSVDLRDIKIVKKLGGTIDDCELVEGIjVLTQKVANSGITRVEKAKIGLIQ 2 5 0 

FCLSAPKTDMDNQIVVSDYAQMDRVLREERAYILNLVKQIKKTGCNVLLI 30 0 

QKSILRDALSDLALHFLNKMKIMVVKDVEREDIEFICKTIGTKPVAHIDQ 3 50 

FTADMLGSAELAEEVSLNGSGKLFKITGCTSPGKTVTIWRGSNKLVIEE 40 0 

AERSIHDALCVIRCIiVKKRALIAGGGAPEIEIjALRLTEYSRTLSGMESYC 4 50 

V RAFADAMEVIPSTLAENAGLNPISTVT ELRNRHAQGEKTTGINVRKGGI 5 0 0 
P3 (T50) X- 

SNILEEMWQPLLVSVSALTIiATETVRSILKIDDWNTR 5 39 



Fig. 3d. 

MAS VGTLAFDE YGRPFLI IKDQDRKSRLMGLEALKSHIMAAKAVANTMRT 0 5 0 

SliGPNGLDKMMVDKDGDVTITNDGATIIiSMMDVDHQIAKIiMVEIiSKSQDD 10 0 

EIGDGTTGVWLAGALLEEAEQLLDRGIHPIRIADGYEQAARIAIQHLDK 15 0 

ISDKVLVDINNPEPLIQTAKTTLGSKVINSCHRQMAEIAVNAVLTVADME 2 00 

RRDVDFELIKVEGKVGGRLEDTKLIKGVIVDKDPSHPQMPKKVVDAKIAI 2 50 

LTCPFEPPKPKTKHKLDVMSVEDYKALQKYEKEKPEEMIKQIKETGANLA 30 0 

ICQWGFDDEANHLLIiQNGIjPAVRWVGGPEIELIAIATGGRIVPRFSELTS 3 5 0 

EKLGFAGWQEISFGTTKDKMLVIEKCKNSRAVTIFIRGGNKMIIEEAKR 4 00 

SLHDALCVIRNIilRDNRWYGGGAAEISCALAVSQEADKCPTIiEQYAMRA 45 0 

FADAIiEVIPMAIiSENSGMNPIQTMTEVRARQVKESNPAIiGlDCLHKGSND 50 0 
HQYQHVXETLZGKKQQISLATQMVRHZXiKIDDIRKPGESEE 54 1 

SUBSTITUTE SHEET (RULE 26) 
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Fig.Se. 

MAAVKTLNPKAEVARAQAALAVNISAARGLQDVLRTNLGPKGTMKMLVSG 05 0 

AGDIKLTKDGNVLLHEMQIQHPTASLIAKVATAQDDITGDGTTSMVLIIG 10 0 

ELLKQADLYISEGLHPRIITEGFEAAKEKALQFLEQVKVSKEMDRETliID 15 0 

VARTSLRTKVHAELADVIiTEAVVDSIIjAIRKKDEPIDLPMVEIMEMKHKS 200 

ETDTSLIRGLVLDHGARHPDMKKRVENAYILTCNVSLEYEKTEVNSGFFY 250 

KSAEEREKLVKAERKFIEDRVKKI lELKKKVCGDSDKGFWXNQKGIDPF 3 0 0 

B3(R170) 

SLDAIiAKEGIVALRRAKRRMMERLTLACGGIALNSFDDLNPDCLGHAGLV 3 5 0 

YEYTLGEEKFTFIEKCNNPRSVTLLVKGPNKHTLTQIKDAIRDGLRAVKN 4 0 0 

AIDDGCVVPGAGAVEVALAEALIKYKPSVKGRAQLGVQAFADALLIIPKV 4 50 

LAQNSGFDLQETLVKVQAEHSESGQLVGVDLSTGEPMVAAEMGVWDNYCV 50 0 
KKOLLHSCTVIATMIIiLVDEIMRAGMSSLKG 531 



Fig.3f. 

MMPTPVir.LKEGTDSSQGlPQLVSNISACQVIAEAVRTTLGPRGMDKLIV 050 

DGRGKATISNDGATILKLLDWHPAAKTLVDIAK SQDAEVGDGTTSVTLL 100 

P2~ 

AAEFLKQVKPYVEEGLHPQl 1 IRAF RTATQLAV NKIKEIAVTVKKODKVE 15 0 

^ fi2(R115) B2{R124) 

QRKMI,EKCAMTALSSKL1SQQKVFFAKMVVDAVMMLDEL1.QLKM1GIKKV 2 0 0 

QGGALEESQLVAGVAFKKTPSYAGPEMQPKKYKNPKIALLMVELELKAEK 2 5 0 

DMAEIRVHTVEDYQAIVDAEWNILYDKLEKIHQSGAKV1I.SKI.PIGDVAT 300 

QYFADRDMFCAGRVPEEDLKRTMMACGGS IQTSVNALVPDVLGHCQVFEE 3 5 0 

TQIGGERYNFFTGCPKAKTCTIILRGGAEQFMEETERSLHDAIMIVRRAI 4 0 0 

KNDSWAQGGAIEMELSKYLRDYSRTIPGKQQLLIGAYAKALEIIPRQLC 45 0 

DNA6FDATNILNRLRARHAQGGHWYGVDINNENZADNFQAFVWEPAMVRI 50 0 

NALTAASEAACLIVSVDETIKNPRSTVDPPAPSAGRGROQARFH 544 
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F i Q . 5 . CCTa KEVGDGTTSWI lA 

^ CCTP DEVGDGTTSVTVLA. 

CCTy EEVGDGTTSVTILA 

CCT6 lEAGDGTTSWI lA 

CCTe DEIGDGTTGWVIA 

CCTC DITGDGTTSNVLII 

CCTn AEVGDGTTSVTLIA 

CCTe QEVGDGTNFVLVFA 



cAPK-a KTLGTGSFGRVMLV 

PKC-a MVXGKGSFGKVMLA 
CaMII-a EELGKGAFSWRRC 

SNFl KTLGEGSFGICVKLA 

Cdc2+ EKIGEGTYGWYKA 

CDC 7 DKIGEGTFSSVYKA 

Raf TRIGSGSFGTVYKG 

Src VKLGQGCFGEVWMG 

Abl HKLGGGQYGEVYEG 

EGFR KVLGSGAFGTVYKG 

INSR RELGQGSFGMVYEG 

PDGFR RTLGSGAFGQWEA 
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Fig. 8a. 

(Ccfa) 

Tcp' 7 cDNA sequence 

TCTCACyiGCGGAGCTATCGGTTGTCTGCGGCCGTAGTCAGCrTTGCTTCGTTTCCTGAAGATG^ 

MEGPLSVFGD 

CGCAGCACTGGGGAGGCGGTCCGCTCCCAGAATG^ 

R5TGEAVRSQNVMAAASIANIVKSSFGPVG 
TTGGATAAAATGTOGCTGGATGATATTGGTGATGTAAOa^ 

LDKHtVDDIGDVTITHDGATILKLLEVEHP 
GCAGCCAAAGTTCTGTGTGAGCTGGCTCACC^ 

AAKVLCEtADLQDKEVGDGTT SVV I lA AEL 
CTGAAAAATGCAGATGAGCTAGK^^ 

I.KHADELVKQKIHPTSV1SGYRLACKEAVR 
TATATCAATGAGAACCTGATTATTAAOlCAGACGAACirroGAAG^ 

yiNENLIINTDELGRDCLINTAKTSMSSKI 
ATTGGAATAAATGGTGATTACTOGCT^^^ 

IGINGDYFANMVVDAVLAVKYTDARGQPRY 
CCAGTCAATTCTOTAATATTCTGAAAKCC^^ 

rvNSVNILKAHGRSQIESMLINGYALNCVV 
GGATCTCAGGGCATGCCC^GAGAATACTTAATG^^ 

t-BQGKPKRlVNAKlACLDFSLQKTKMKlGV 
CAGCTGGTTATTACAGACCCTGAGAAACT 

WvviTUPEKLDQIRQRESDITKERIQKILA 
ACTGGTGCCAATOTATTCTAACCAC^ 

TGANVILTTGGIDDMYLKYFVEAGAMAVRR 



lCTTTT 
F 



GTTTOAAAACGAGACCTGAAGCATOTG^^ 

vt.KRDI,KHVAKASGASILSTLANI-E.GEET 
ACATCTGCTTCAATCATCCTACGAGGAGO^ 

i&ASIILRGANDFHCDEHERSLHDALCVVK 
AGAOTTTGGAGTTGAAATCTCT^ 

RVLEI,KSVVPGGGAVEAALS1YI,ENYAT NM 
GGATCTCGOGAACACCTTGCTATTG^ 

GSREQLAIAEFARSLLVIPNTLAVKAAQDS 
AOZGACCTGGTTGCCAAGW 

XBI.VAKI.RAFHNEAQVNPERKNLKWIGLDI. 
GTCCATGGGAAACCACGAGAOV^ 

VBeKPRDNKQAGVFEPTIVKVKSLKFATEA 
GCAATCACCATCXTTCGGATTGACGATCT 

AlTl i-RIOOLIKLHPESKDDKHGSYENAVH 
TCTGGAGCCCTTGATGACTGATTGGATTTCC^ 



S G A L O O • 
ACAGCAAGCTGT POly(A)* 
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Fig.Sb. 

(Cctb) 

pTbeta2 cDNA Sequence 



GGCTTCCCGGCGCGGATTGTGAGGGGTTCTCTCCTCTACGCGGAATCCGTCGGAACCATGGCTTCCCTTTCCCTCGCACCTGTTAATATC 

MASLSLAFVNX 

TTCAAGGCTGGAGCTGATGAAGAGAGGGCCGAGATAGCTCGCCTGTCGTCGTTTATCGGTGCCATCGCCATTGGAGACTTGGTGAAGAGC 
FKAGADEERAE I ARLSSF IGAIAI GDLVKS 

ACTTTGGGACCGAAGGGCATGGACAAAATTCTCCTAAGCAGTGGACGAGACGCCGCTCTGATGGTGACCAACGACGGTGCTACCATTCTC 
TLGPKGMDKILLSSGRDAALMVTNDGATIL 

AAGAACATTGGTGTGGACAACCCCGCAGCAAAGGTTCTAGTTGATATGTCAAGGGTTCAAGATGATGAAGTTGGTGATGGCACTACCTCT 
KNIGVONPAAKVLVDMSRVQDDEVGDGTTS 

GTTACTGTCTTAGCCGCAGAGCTGCTCCGGGAAGCAGAATCTTTAATTGCAAAAAAGATACATCCACAGACCATCATCTCAGGTTGGAGA 
VTVLAAELLREAESLIAKKIHPQT I ISGWR 

GAAGCCACAAAGGCAGCAAGAGAGGCCCTGCTGAGCTCCGCTGTGGATCATGGTTCTGATGAAGCCAGATTCTGGCAGGACTTAATGAAC 
EATKAAREALLS SAVDHGSDEARFWQDLMN 

ATTGCAGGAACGACATTGTCCTCAAAGCTCCTTACTCACCAOVAGGACCyiCrrTTACTAAACTGGCCGTGGAAGCGGGTCTCAGAC 
lAGTTLSSKLLTHHKDHFTKLAVEAGLRLK 

GGCTCTGGCAACCTGGAGGCGATTCATGTCATCAAGAAACTAGGTGGGAGTCTGGCAGACTCCTATCTAGATGAAGGTTTTCTTTTGGAT 
GSGNLEAIHVIKKLGGSLADSYLDEGFLLD 

AAAAAAATTGGAGTAAATCAACCAAAGAGAATTGAAAATGCTAAAATTCTTATTGCAAATACTGGGATGGATACAGACAAAATAAAGATA 
KKIGVNQPKRIE NAKILIANTGMDTDKIKI 

TTTGGCTCTCGGGTAAGAGTTGATTCCACAGCAAAGGTTGCAGAGATAGAACATGCAGAAAAGGAGAAGATGAAGGAGAAAGTTGAAC 
FGSRVRVDSTAKVAEIEHAEKEKMKEKVER 

ATTCTTAAGCATGGAATAAATTGCTTTATTAACAGACAGTTAATTTATAATTACCCTGAACAACTCTTCGGCGCTGCTGGCGTCATGGCT 
ILKHGINCFINRQLIYKYPEQLFGAAGVMA 

ATTGAGCATGCGGATTTCGCAGGTGTGGAGCGCCTCGCTCTTGTCACAGGTGGTGAGATTGCCTCTACCTTTGATCACCCAGAACTTGTG 
lEHADFAGVERLALVTGGEIASTFDHPELV 

AAGCTTGGAAGTTGCAAACTTATTGAAGAAGTTATGATCGGGGAAGATAAACTCATTCACTTTTCTGGGGTTGCCCTTGGTGAGGCATGC 
KLGSCKLIEEVMIGEDKLIHFSGVALGEAC 

ACCATTGTGCTTCGTGGTGCCACTCAGCAAATTCTGGATGAAGCTGAACGATCTCTGCATGATGCTCTTTGT6TTCTTGCTCAGACTGTA 
TIVLRGATQQILDEAERSLHDALCVLAQTV 

AAAGATCCTAGAACAGTTTACGGGGGAGGCTGCTCTGAGATGCTGATGGCCCATGCTGTGACACAGCTTGCCAACAGAACCCCAGGAAAA 
KDPRTV YGGGCSEMLMAHAVTQLANRTPGK 

GAAGCTGTAGCAATGGAGTCGTTTGCAAAAGCCCTGAGAATGTTGCCGACCATCATAGCCGACAATGCGGGCTATGACAGTGC^ 
EAVAMESFAKALRMLPTI lADNAG YDSADL 

GTGGCACAGCTCCGAGCTGCTCACAGTGAAGGCCATATAACTGCTGGACTGGATATGAAGGAAGGTACCATCGGCGATATGGCAGTACTG 
VAQLRAAHSEGHITAGLOMKEGTI GDMAVL 

GGTATAACAGAGAGTTTTCAAGTGAAGCGACAGGTTCTTCTGAGTGC6GCTGAAGCAGCAGAGGTGATCCTGCGAGTGGACAACATTATC 
GITESFQVKRQVLLSAAEAAEVILRVDNII 

AAAGCAGCACOlAGGAAACGTGTCCCCGATCyVCCACCCCTGTTAAGCATTCCCATTTGCTGATGAACTCTGGGCCAGTTCATA^ 
KAAPRKRVPDHHPC. 

TGTACTTGGAAGACTTCAACCTTTAAAGAAGACTGGTGGAATTGACCTGTCCATGATAGCCTTAAGTTTGAACATTAGCTGACCTT^^ 



gttaaacatgggtctaatttatttactgtttcattttccatacaattcagttgatttacaagttcatttctcatactgtgtatta;^ 



AAAATCCAGTTACTTAGCCCTTAAAAAAAAAAAA 
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Fig.Sc. 

(Cctg) 

pTgamma? cDNA Sequence 

TGTCATTCCTGAGGAGGCAGCGTTTTCTTCGCTGCTCTTCTCTCCAGAAGGTTCTGCCGAWCCCCCCAGCTCTGGA 



CGTCGTGCCGCCATGATGGGCCACCGTCCAGTGCTCGTGCTCAGTCAGAACACAAAGCGTGAATCTGGAAGAAAAGTTCAGTCTGGAAAT 
MMGHRPVLVLSQKTKRESGRKVQSGK 

ATCAATGCTGCAAAGACAATTGCAGACATCyiTCCGGACCTGCTTGGGACCTAAATCTATGATGAAGATGCTTTTGGACCCAATGGGAGGC 
IKAAKTIADIIRTCLGPKSMMKMLLDPMGG 

ATCGTGATGACCAATGATGGCAATGCCATTCTTCGAGAGATTCAAGTCCAGCATCCCGCAGCAAAGTCCATGATTGAAATTAGCAGGAC 
IVMTNDGNAILREIQVQHPA-AKSMIEISRT 

CAGGATGAAGAGGTTGGAGATGGGACCACATCAGTAATTATTCTTGCGGGAGAAATGCTCTCTGTGGCTGAACACTTCCTAGAGCAGCAG 
QOEEVGDG TTSVIILAGEMLSVAEHFLEQQ 

ATCCACCCAACAGTGGTGATCAGTGCTTACCGCATGGCACTGGATGATATGATCAGCACTCTGAAGAAAATCAGTACTCCTGTTGATGTC 
MHPTVVISAYRMALDDMISTLKKISTPVDV 

AATAACCGTGAGATGATGTTGAGCATCATCAATAGCTCTATTACTACAAAAGTCATCAGTCGGTGGTCCTCCTTGGCATGCAACATTGC^ 
NHREMMLS 1 ZKSS ITTKVISRWSSLACN lA 

CTGGATGCTGTTAAGACTGTGCAGTTTGAAGAGAATGGCAGAAAGGAAATTGACATCAAAAAATATGCAAGGGTAGAAAAGATACCCGGG 

ldavktvqfeengrkeidikkyarvekipg 

ggcatcattgaagactcatgtgtcttacgtggagttatgattaacaaggatgtgacccatccaaggatgcgccgctatattaagaatcct 
giiedscvlrgvminkdvthprmrryi knp 

cgaattgtgctattggattcttctttggagtacaagaaaggagaaagccagaccgacatcgagattacccgggaggaggacttcacgcgg 
rivlldssleykkgesqtdieitreedftr 

atcctgcagatggaggaggagtacatccatowsctgtgtgaggacatcatcoigctgaagcctgacgtggtcatcacagag;^ 
ilqmeeeyihqlcediiqlkpdvvitekgi 

tcagatttagctcagcactacctcatgcgggccaatgtcacagccattcgtagagtccggaaaacagacaataat 

SDLAQHYLMRANVTAIRRVRKTONNRIARA 

tgtggggcacggatagtcagccgacctgaggaactgagagaagatgatgttggtacaggggcaggcttattggaaatcaagaagattgga 
cgarivsrpeelreddvgtgaglleikkig 

gatgagtactttacattcatcactgactgcaaagacccaaaggcctgcaccattcttcttagaggagcoigcaaagagatactc 
deyftfitdckdpkactillrgaskeilse 

gtagaacgcaacctccaggatgccatgcaagtgtgccgcaatgttctactggaccctcagctggtgcctggtggtggagcctctgagat^ 
vernlqoamqvcrhvlldpqlvpggg asem 

gctgtggctcatgccttgaoigaaaaatctaaggccatgactggtgtggaacaatggccatatagagctgtggcccaggcgtta 
avahaltekskamtgveqwp yravaqalev 

atccctcggaccttgatccagaattgtggggccyigtaccattcgtctgcttacctcccttcgggccjulgcacacatt^ 
iprtliqncgastirlltslrakhtqesce 

acctggggtgtgaatggtgagactggtaccttggtggacatgaaagagctgggtatttgggagccattggctgtgaagctacaaacgtac 
twgvngetgtlvdmkelgiwep lavklqty 

aaaacagcagtagagactgcagttctgcttctgcggattgatgacattgtctctggccacaagaagaagggtgatgaccagaaccggcaa 

KTAVETAVLLLRIDDIVSGHKKKGDDQNRQ 

ACTGGTGCTCCAGATGCTGGCCAGGAGTGAGTGCTGAGCyiCGGTGACTTCCATGCACAGAACCAGCAGTCTCCCCTTTCCTGAGCCAGAG 
TGAPDAGQE 

TTCCAGGAACACTGTGGACATCTTTGTTTGCGAAGGATCAGGTTGAGGGGCAGCCCCCAGTCTGTCCCATCTCAGTTTGCAAAAA^ 



gacacgtatctcttctattgtaagctttccatttagtttgcttccaatgattaaatctaagtcatttgaaaaaaaa;^^ 
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Fig.Sd. 

(Cctd) 

pTd0lta2 cONA Sequence 

GGCTCTCCAGCAGGCCCCTTGCTCGGCTTCCGCCTTCTCTCCCGGCGTCGCTTTCTGGAAGGTTCGTGGAGGAGGCGGTCAGGGAGACC 



GTTACTCCACAGCAAGCCGGAATCCGTGTCCATCCGTCCTCCTGAACCCGCGCAGACGCCACCAAGGTCGCCATGCCGGAGAACGTAGCT 

M P E N V A 

TCCCGAAGCGGGGCGCCCACCGCCGGGCCCGGCAGCCGCGGGAAAAGCGCCTACCAGGACCGCGACAAGCCAGCCCAGATCCGCTTCAGC 
SRSGAPTAGPGSRGKSAYQDRDKPAQIRFS 

AATATTTCCGCGGCCAAAGCGGTTGCTGATGCTATTAGAACAAGCCTTGGACCTAAAGGAATGGACAAAATGATTCAAGATGGAA^ 
NISAAKAVADAIRTSLGPKGMDKMIQDGKG 

GATGTGACCATTACAAATGATGGTGCCACCyiTTCTGAAACAAATGCAGGTATTGCATCCAGCAGCaiGAATGCTGGTGGAACT 
DVTITNDGATILKQM QVLHPAARMLVELSK 

GCTCAAGACATAGAAGCAGGAGATGGCACCACGTCGGTTGTCATCyiTTGCTGGCTCTCTTTTAGACTCCTGTACCAAACTT^ 
AQDIEAGDGTTSVVIIAGSLLDSCTKLLQK 

GGTATACATCCAACCATCATTTCCGAGTCATTCCAGAAAGCTTTGGAAAAGGGTCTTGAAATCCTTACTGACATGTC 
GIHPTIISESFQKALEKGLEILTDMSRPVQ 

CTGAGCGACAGAGAAACTTTGTTAAATAGCGCAACTACTTCATTGAATTCAAAGGTTGTCTCTCAGTATT^ 

LSORETLLNSATTSLKSKVVSQYSSLLSPM 
AGTGTCAATGCGGTGATGAAAGTGATTGACCCyiGCCACAGCTACCAGTGTAGATCTTCGAGAT^^ 

SVNAVMKVIDPATATSVDLRDIKIVKKLGG 
ACAATAGATGACTGTGAGCTGGTGGAAGGCCTCGTTCTCACACAGAAAGTAGCAAATTCTGGCATAAO^ 

TI 0DCELVEGLVLTQKVANS61TRVEKAKI 
GGGCTTATTCAGTTTTGCTTATCTGCTCCTAAAACAGATATGGATAATCAAATAGTAGTAT^ 

GLIQFCLSAPKTDMONQIVVSDYAQMDRVL 

CGAGAGGAGAGAGCCTATATTTTAAATTTGGTGAAGCAAATTAAGAAAAaVGGATGTAATGTCCTTCTOlTACAG^^ 
REERAYILNLVKQIKKTGCNVLLIQKSI LR 

GATGCCCTTAGTGATCTTGCATTACATTTTCTGAATAAGATGAAGATTATGGTGGTTAA^^ 

DALSDLALHFLKKMKIMVVKDVEREDIEFI 
TGTAAGACAATTGGAACCAAACCAGTTGCTCACATTGACOIGTTCACTGC^ 

CKTI GTKPVAfllDQFTADMLGSAELAEEVS 
WAJUITGGTTCTGGAAAACTATTCAAGATTACAGGTTGTAC^ 

LNGSGKLFKITGCTSPGKTVTIVVRGSNKL 
CTGATTGAAGAAGCTGAGCGCTCCyVTTCATGATGCTCTCTGTGTOlTCCGATG 

VI EEAERS IHDALCVI RCLVKKRALIAGGG 

GCTCCAqAAATAGAGCTGGCCCTCAGACTGACAGAGTACTCCCGAACACTGAGTGGTATGGAGTCCTACTGTGTTCGTGCTT^ 
APEIELALRLTEYSRTLSGMESYCV. RAFAO 

GCTATGGAAGTCATTCCATCTACACTAGCTGAAAA 

AMEVIPSTLAENAGLNP ISTVTELRNRHAQ 
TCAGAAAAAACTACAGGCATTAATGTCC6AAAGGGTGGGATCTCCAACATTTTG 

GEKTTGINVRKGGISN ILEEMVVQPLLVSV 

AGTGCTTTGACCTTAGCAACTGAAACTGTGCGGAGCATTCTGAAAATCGATGATGTGGTAA^ 
5ALTLATETVRSILKIDDVVNTR 

T6ACTGCATCATTATGGACAGAAGTACTGTGGCTGGAATGAAGGACAACCACCTTGTTCCTO 



TGTCTTCCAGTTGGCATTTGTTTGAAATTGTATTGAAACyuiTTTAATGAAAACATTAAATAOT 



AAAAAAA 
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Fig.Se. 

(Ccte) 

pTepsilonS cDNA Sequence 
«;ATTCTGCGTCCTCTCGCGAGAa:GAGGTGCGCrrCTGTCTCTCTTCCCGAGT<K5TCCCCGCTGGTTACTGGAGGAGCGCTCGTTCTAG 
TCGTCCACCATGGCGTCCCTGGGGACCCTCKCWCGATG^^^^^ 

CTCATGGGGCTTGAGGCCCTCAAGTCTCACAT^^^^ 

GACAAGATGATGGTTGATAAGGATGGCGAT^^^ 

GCCAAG«GATGGTTGAACTGTCCAAAT^^^^ 

GAAGAAGCTGAACAGCTGCTCGACCGAGGCATTC^^^^ 

CTGGACAAGATCAGCGATAAAGTGCTTM^^^ 

ATTAACAGCTGTCACCGACAGATGGCT^^^^ 

CTCATTAAAGTGGAAGGCAAAGTAGGTG^ 

w^A-r,uxKi,IKGVlVDKDrSHP 
CAGATGCCG^UUAAAGTGGTAGAI^^ 

GTCATGTCTGTGGAGGACTACAAAGCCC^^^ 

AACCTAGCTATTTGCCAGTGGGGCTTTGACGATG^^^ 

GTAGGGGGA 

wwtAWHLLLQNGLPAVRWVGG 
CCTGAGATTGAGCTGATCGCCATTGCAACAG^ 

GTGGTGCAGGAGATCTCCTTTGGCACTACAA^^ 

AGAGGAGGAAACAAGATGATCATAGAAC«^ 

GTTGTGTATGGAGGAGGGGCAGCCGAAATATCCT^^^^ 

ATGAGAGCTTTTGCAGATGCC^GGAGGTCA^^^ 

«**i,v J-*'«ALSENSGMNPIQTMTEV 
CGAGCCAGACAGGXGAAGGAGTCTAACCCTGCCW 

OAAACCTTGATTGGC^GCAGCAGATCTCTCM^^^ 

TAAGCCTGGA 

'^'^y^^RMILKlDDlRKPG 
GAATCTGAAGAATAAACTGTACCATTTACCACTGTGACTAAATAAAGGGTGTGTCT^ 
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(Cctz) 



pTzetalZ cDNA sequence 



gaagaccccgcagagagcacgttgttctcggcgtctcccggagctaggccagccatggcggcggtaaagaccctaaatccgaaggccgag 

maavktlnpkae' 

gtggcccgggcccaggcagcgctggcggtgaacatcagcgcggctcggggcctgcaggatgttctgaggaccaacttggggcctaagggc 

VARAQAALAVNISAARGLQDVLRTKLGPKG 

ACCATGAAGATGCTTGTATCTGGTGCTGGAGACATCAAACTTACTAAAGATGGCAATGTGCTGCTTCATGAAATGCAAATTCAACACCCA 
TMKMLVSGAGDIKLTKDGNVLLHEMQIQHP 

ACAGCCTCTTTGATAGCAAAAGTGGCTACAGCCCAGGATGACATAACTGGCGATGGCACTACATCCAATGTCCTCATCATCGGGGAGCTG 
TASLIAKVATAQDDITGDGTTSNVLIIGEL 

CTCAAACAGGCGGACCTGTACATTTCTGAAGGTCTTCACCCAAGAATAATAACTGAAGGTTTTGAAGCGGCAAAAGAAAAGGCACTCCAA 
LKQADLYISEGLKPRIITEGFEAAKEKALQ 

TTTCTGGAACAAGTCAAAGTAAGCAAAGAGATGGACAGAGAAACACTCATCGATGTGGCCAGGACATCTCTGCGGACTAAAGTTCA^ 
FLEQVKVSKEMDRETLIDVARTSLRTKVHA 

GAACTTGCAGATGTCTTGACAGAGGCTGTAGTGGACTCCATCTTGGCCATTAGGAAAAAGGACGAGCCCATTGACCTCTTCATGGTTGAG 
ELAOVLTEAVVDS ILAIRKKDEPIDLFMVE 

ATCATGGAGATGAAGCATAAATCTGAGACAGATACAAGCTTAATCAGAGGGCTTGTTTTGGATCATGGAGCTCGGCATCCTGATATGAAG 
IMEMKHKSETDTSLIRGLVLDHGARHPDMK 

AAGAGAGTGGAAAATGCCTACATCCTCACGTGCAACGTGTCCTTAGAGTATGAGAAAACAGAAGTGAATTCTGGGTTTTTTT 
KRVENAYILTCNVSLEYEKTEVKSGFFYKS 

GCAGAAGAGAGAGAAAAACTAGTAAAGGCTGAAAGAAAATTCATTGAAGATAGAGTTAAAAAAATCATAGAGCTGAAAAAGAAAGTCTGT 
AEEREKLVKAERKFIEDRVKKI lELKKKVC 

GGTGACTCAGATAAAGGATTTGTCGTTATTAATCAAAAGGGGATTGACCCCTTTTCCTTAGATGCCCTTGCGAAAGAAGGGATCGTAG^^ 
GDSDKGFVVINQKGIDPFSLDALAKEGIVA 

CTGCGCAGAGCa^GAGGAGAAACATGGAGAGGCTGAOVCTTGCTTGTGGTGGGATAGCTCTGAATTCCTTTGATGACCTGAATCCTGA 
LRRAKRRNMERLTLACGGIALNSFDDLNPD 

TGTTTGGGACATGCAGGGCTTGTCTATGAGTATACACTGGGTGAGGAGAAGTTCACCTTTATTGAGAAGTGTAACAATCCC 
CLGHAGLVYEYTLGEEKFTF lEKCNNPRSV 

ACTTTACTGGTTAAAGGACCAAATAAGCACACACTGACTCAAATOUIGGATGCAATAAGAGATGGCTTGAGGGCTGTCA^^ 
TLLVKGPNKHTLTQIKDAIRDGLRAVKNAI 

GATGATGGCTGTGTTGTCCCAGGTGCGGGTGCAGTAGAAGTGGCACTGGCAGAAGCTCTGATTAAATACAAGCCCAGTGTGAAGGGCAGG 
DOGCVVPGAGAVEVALAEAL IKYKPSVKGR 

GCGCAGCTTGGAGTCCAGGCATTTGCAGATGCCTTGCTCATCATTCCCAAGGTTCTTGCGCAAAACTCTGGTTTTGACCTO^ 
AQLGVQAFADALLIIPKVLAQN SGFDLQET 

TTAGTTAAAGTTCAAGCTGAACATTCAGAATCGGGCCAGCTCGTAGGTGTGGATCTGAGCACAGGTGAGCCGATGGTGGCCGCAGAGATG 
LVKVQAEHSESGQLVGVDLSTGEPMVAAEM 

GGTGTGTGGGATAACTACTGTGTGAAGAAGCAGCTGCTACACTCCTGTACTGTGATCGCOICCAACATTCTCCTGGTCGACGAGATCATG 
GVWDNYCVKKQLLHSCTVIATNILLVOEIM 

CGAGCTGGGATGTCCTCTCTGAAGGGTTGAGGCCTGCCTGTGATACTACAGGATGTTGGGGGGAATGGTTATTTTTG^ 
RAGHSSLKG 

TGATTTGGAAAAAAATTTTCTCTTCCTGATTGGAGAAAAGAAACGGGACATTTGACACCTATTCAAATTATACTGTAA^ 
TTTTTGCCTTGAGTATCTGAAGACACTCAAAGCAGCTCTTTTTaiACCCACTGAA^ 

TAATAAGATAAGCATGTTGTCTACCCTTGTTCCATAAGTGTTCTTTGAAAGTTTGTAATGGTTTTCTCCTAAATAAGGCA^ 
TGCCTGTAAGCCTAGCCCTTTGGAAATAGTCCGGAATTTCTATGCCAACTCAGGCTACAGGAGACCCCAGGTCGAAAGAAT^^ 
TGGATGTATTTGAAATTATCCAGCCAACTCCCTGTTAAACATGTAAGATCCTTGCCAGTGTAAAACACATCTGGG 
ATAATCTCTAATAAATACTTAAAAGAGTGAAAAAAAAAAAAAAAA 
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Fig.Sg. 

(Ccth) 

pCBLSO cDNA sequence 

AGATGAT<»:CACACCACTTATCCTCTTGA^ 

HMPTPVILLKEGTDSSQGIPQLVSNISAC 
CAAGTGATTGCTGAGGCTOTAAG^ 

UVIAEAVRTTLGPRGHDKLIVDGRGKATIS 
AAO^TGGGGOCACaiATTCTGAAAC^^ 

NDGATILKLLDVVHPAAKTLVDIAKSQDAE 
GTTGGTGATGGCACCACCTCAGTGACOT 

VGDGTTSVTLLAAEFLKQVKPYVEEGLHPQ 
ATCATCATCCGAGCrnCCGCACyiTCC^^ 

IRAFRTATQLAVNKIKEIAVTVKKQDKV 
GAGCAGAGGAAGATGCTGGAJ^TGTGCGA^ 

EQRKMtEKCAMTALSSKLISQQKVFFAKMV 
GTTGATGCCGTGATGATC^GACGAC^GCT^^ 

VDAVMHLDELLQLKHIGIKKVQGGALEESQ 
CTAOTGCTGGTGTTGCCTTC*^ 

uvAevAFKKTFSYAGFEMQPKKY KNPKIAL 
TOAAATGTTGW^GW^GAAAG^ 

l-riVEiELKAEKDNAElRVHTVEDYQAIVDA 
GAGTGGAATATTCTCTATGA<^AGCT 

t.W«II.YDKLEKIHQSGAKVII,SKLPIGDVA 
ACCCAGTACTTTGCTGATAGGGACATGTTCTGTG^ 

*^*°^^"'^CAGRVPEEDLKRTMMACGGS 
ATCCAGACCAGTG«^TCCTCTGGTT^ 

iUiS.VNAI,VPDVLGHCQVFEETQlGGERYK 
TTCTTCACTGGCTGCCC^^ 

i».i.fKAKTCTI II-RGGAEQFMEETERSL 
CATGATGCCATCATGATTCTC^GGAGGGCCA^^ 

HDAIMIVRRAIKNDSVVAGGGAIEMELSKY 
CTGCGGGATTACTCGAGGACCATTC^^ 

i-RDTSRTI PGKQQl.i,iGAYAKALEIlPRQt. 
TGrGA^GCTGGCTTTG^ 

w««AOfBATNlLNKLRARHAQGGMWYGVDI 
AACAATGA(»ACATCGCCGACAA^ 

««tH lADNFQAFVWEPAMVRINALTAASEA 
GCATGCOTATTGTGTCCGTGGATG^ 

A1.1.IVSVDET1KNPRSTV0PPAPSAGRGRG 
CAAOCCCGCTTCCACTGM^^ 

CTCACTGAGGTTATTTAAATAAAGC 
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Fig.8h. 

(Cctq) 

pTthetal cDNA sequence 



CCCATTCCOlGCAGGCAGTGCGCGGCGCCCCTTGCTTCCTCGGCGTCTCGGAGCGAGCGCGTGAGCTC^ 

M 

GCGCTTCACGTCCCCAATCCCCCGGGCTTTGCCCAGATGCTCAAGGATGGAGCCAA^ 

ALHVPKAPGFAQMLKDGAKHF SGLEEAVYR 
AATATACAGGCCTGCAAGGAGCTTGCTttGACTACTCGCACAGCGTACGGACCAAATGGAA 

NIQACKELAQTTRTAYGPNGMKKMVINRLE 
AAGTTGTTTGTGACAAACGATGCAGCGACTATTTTAAGAGAGCTAGAAGTGCAGCATCCTGCTG^ 

KLFVTNDAATILRELEVQHPAAKMIVMASfi 
ATGCAAGAACAAGAGGTTGGTGATGGCACAAACTTCGTTCTGGTGTTCGCTGGGGCTCTO 

MQEQEVGDGTNFVLVFAGALLELAEELLRI 
GGCCTGlXyiGTATCAGAGGTAATATCGGGGTATGAAATAGCTTGCAAAAAAGCTCATGAC^^ 

GLSVSEVISGYEIACKKAHEILPELVCCSA 
AAAAACCTTCGAGATGTTGATGAAGTGTCCTCTCTGCTTCGAACCTCCATCATGAGTAAGCAG^^ 

KNLRDVDEVSSLLRTSIMSKQYGSETFLAK 
CTTATTGCTCAGGCTTGTGTTTCTATCTTTCCTGATTCTGGCAATTTCAATC 

LIAQACVSIFPDSGKFNVDKIRVCKILGSG 
ArrTATTCATCCTCAGTATTACATGGCATGGTTTTTAAGAAGGAAACTG^ 

lYSSSVLHGMVFKKE TEGDVTSVKDAKIAV 
TACTCTTGTCCGTTTGATGGCATGATAACAGAGACAAAGGGGACCGTGCTGATTAA 

YSCPFDGMITETKGTVLIKTAEELMNFSKG 
GAGGAGAACCTCATGGATGCTCAGGTGAAGGCCATTGCAGG^ 

EENLMDAQVKAIAGTGANVIVTGGKVADIA 
CTTCATTATGCTAAOU^ 

LHYANKYNIMLVRLNSKWDLRRLCKTVGAT 
GCTOTCCAAAATTGACTCCTCCCGTCC^ 

ALPKLT PPVQEEMGHCDSVYLS EVGDTQVV 
GTTTOAAGCATGAAAAAGAAGATGGT^ 

VFKHEKEDGAISTIVLRGSTDNLMDDIERA 
GTAGATGATGGACTTAATACTTTCAAAGTTCTCAC^ 

VDCGVNTFKVLTRDKRLVPGGGATEIELAK 
CAAATCACATCATATGGAGAGACGTGTCCTGGGCTTGAACAGTATGCTATTAAGAAGT^^ 

QITSYGETCPGLEQYAIKKFAEAFEAIPRA 
CTGGCAGAAAATTCTGGCGTGAAGGCCAATGAAGTTATCTCTA^ 

t.AENSGVKANEVI SKLYSVHQEGNKNVGLD 
ATCGAGOTGAAGTCCCTGCTGTAAAGGATATGTTAGAAGCCAGTATm 

I EAEVPAVKDMLEASILDTYLGKYWAIKLA 
JpriATGCTGCAGTCACTGTACTAAGAGTGGATCAGATCATCATGG 

TNAAVTVLRVDQIIMAKPAGGPKPPSGKKD 



W D D D Q N D . .«^w.._ww.«.«...rATTTTTCTTA 

TTTGTGTTTGTACTCTGCTGGGTGGTACAATAAATGTGTTACTGTTAAAAAAAAA 
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Fig. 1 1b, 
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Fig. 13a. 
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